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Abstract 

Executive Order 12003 requires that all federal activi- 
ties reduce their energy consumption by 20% from fiscal year 
1975 use by fiscal year 1985. The Defense Energy 
Information System for shore activities (DEIS II) provides a 
system of measuring, reporting, and comparing energy use 
levels in pursuit of that goal. 

The use of regression and time series models of energy 
use were examined for application within the present DEIS II 
system. A data base of monthly electricity use, gross floor 
areas, four weather variables, ani building area category 
identification codes were used in a framework study of 12 
Naval Regional Medical Centers. Specific methodologies for 
model development, interpretation, and application to a 


control system are demonstrated and discussed. 
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A. BACKGROUND 


Prior to 1973, energy was considered an inexhaustible 
and expendable supply. It was an inexpensive commodity. 
During the period between 1950 and 1973 the price of most 
energy products increased less rapidly than the price of 
other commodities. In fact, in terms of 1976 dollars, the 
real price cf energy was actually declining. The real price 
of residential electricity fell by 55% ($20.30/MBTU to 
$8.94 /MBTU), that of gasoline by 21% (63.0/gal to 49.7/gal), 
and that of natural gas by 20% ($2.01/MBTU to $1.60/MBTD) 
[Bef.. 1}. A large population of observers reasoned that 
energy demands are determined by technological progress and 
economic growth. The implicit assumption was that the price 
of energy had a negligible impact on the use of energy. 

As predicted by the first law of economic demand, 
decreasing energy prices encouraged consumption. By 1973, 
bargain priced electricity had incr2zased in per capita use 
by 350%. Less dramatic declines in natural gas resulted in 
per capita consumption increases of 60% {Ref. 2]. The 
American automobile became heavier and more powerful as fuel 
economy became less of a consideration. 

Building construction was also affected by the price of 
energy. Buildings were designed and constructed primarily 
With initial costs in mind. Construction materials were not 
selected on the basis of insulation qualities. Aesthetics 
favored large window areas. Siting was based on factors 
other than the benefits of solar position. The result was 
the creation of a vast inventory of commercial buildings 


which, by today's standards, utilize excessive amounts of 





energy. Commercial buildings today account for approximately 
8% of our national energy use {[Ref. 3]. This present inven- 
tory is being replaced at the rate of only 2-3% per year. 
The majority of existing buildings for many years to come 
Will be those which were originally not designed with energy 
conservation in mind. 

The 1973 Arab oil embargo abruptly ended declining real 
energy prices. The effect on supply and the soaring energy 
prices shocked the world, dispelling the belief in an inex- 
haustible and expendable supply. Moreover, the belief in 
vertical supply and demand curves was recognized as erro- 
neous. With the sharp rise in energy prices, energy use 
declined. In accord with the second law of economic demand, 
the response was greater in the long run than in the short 
run. The initial response was relatively weak as lights 
were turned off, thermostats reset, and fewer miles were 
driven. In the two year period from 1973-1975, per capita 
use decreased by 7%; national enargy use dropped by 5% 
[Ref. 4). Long range response, however, required more 
capital investment and tine for implementation. Additional 
building insulation, installation of more fuel efficient 
machinery, and purchase of smaller cars were a few of such 
longer range responses. 

More than an applied l2sson in economic laws, tne fuel 
crisis demonstrated how reliant American society had become 
on foreign oil. Such reliance was recognized as a threat to 
the national security. The federal government initiated 
programs to become less dependent upon, if not totally inde- 
pendent from, foreign oil. Such programs called for sharp 
reductions in use and the development of alternate energy 
sources. But such a national appeal could not overlook the 
fact that the federal government itself was the largest 
single energy user in the nation, consuming more than the 
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combined total of the five largest private users of 
energy--U.S. Steel, Union Carbide, Sulf Oil, International 
Paper, and General Motors Corporation. FY 1978 federal use 
accounted for 2.2% of the national use [{Ref. 5]. The 
federal appeal to conserve energy precipitated a "physician, 
heal thyself" role. 


B. ISSUES 


To achieve an objective, one must first define the 
objective. Setting goals is very important in meeting the 
objective. The objective must first be realistic. If set 
too low, the system will not realize its full potential. If 
set too high, the system may even waste resources and effort 
trying to achieve the unattainable. It may tend to circun- 
vent the objective, realizing its inability, gol sol Ua) alle fi Bsefe. 
attain even an optimal response. Instrumental in achieving 
the ultimate objective is setting targets, or goals, to 
direct progress and efforts. Such goals must also be 
realistic to properly drive the intended system response. 

Once goals have been established, monitoring provides a 
means of assesssing the realities of the goals and objec- 
C272. The effectiveness and efficiency of accomplishment 
also become major items of concern. 

These general principles are applicable to the federal 
energy conservation effort. The federal government, being a 
large energy consumer, must play a majer role in achieving 
national energy reductions. DOD is in a similar role within 
the federal government. In FY 1975, total federal energy 
use was over 1800 million MBTU. Buildings and facilities 
accounted for almost half of such use at almost 900 million 
MBTU. DOD use represented over 81% of the total federal 
energy use. +s buildings accounted for over 67% of the 
total federal building energy use. Consequently, strong 
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attention has been given to reducing energy use in federal 
buildings and facilities. 

Initially, federal goals and objectives were established 
relative to total energy use in FY 1973. As reporting and 
monitoring systems were refined, short range targets gave 
way to formalized planning. A formal plan was officially 
instituted by President Carter with the issuance of 
Executive Order 12003 of July 20,1977. Defining FY 1975 as 
the baseline year, and defining objectives and targets in 
terms of energy use per square foot eof floor space 
(MBTU/SF), target percentage reductions through FY 1985 were 
specified within the federal sector. Defense Energy Program 
Policy Memorandum (DEPPM) Now 78-2 9£ 1 March 1978 formally 
implemented these goals within DOD. The following goals for 
building use were directed: 

" 24ua2S "260332 oxbsedng Basi Giags” (eSngerudisa PSE 
ietion completed) by PY'f9a5. sia a Se ruct oF 

prea ieniy sparen ce) ner ise "Teli s: 
completed after the date of promulgation). 

Defense Energy Policy Program Memorandum (DEPPM) No. 
80-6 of 3 June 1980 extended energy goals through FY 2000. 
These goals, which implemented not only total energy use 
reductions, but targets for percentage of use by energy 
source, in 5-year phased targets. fae fellowang goals, 
relating to building energy use, were specified relative to 
the FY 1975 baseline: 


By FY 1985: 
1. 20% reduction in total energy use. 
2. 30% reduction in natural petroleum fuels use. 


3. 10% of total use for solid fuel conversion sources, 
iwue., Coad» Solid waste, wood,, etc. 


a HH ef .tetal. use f£Ee renewable sources ieee 
geothermal, wind, solar, "lowhead hydropower, etc. : 


pe FY 990; 


12 





1. 25% reduction in total energy use. 
2. 35% reduction in natural petroleum fuels use. 
3. 15% of total use from solid fuels conversion. 
4. 5% of total energy from renewable sources. 

Be 1°95: 
1. 30% reduction in total energy use. 
2- 40% reduction in natural petroleum fuels use. 
3. 20% of total use from solid fuels conversion. 
GY. 10% of total use from renewable sources. 

By FY 2000: 
1. 35% reduction in total energy use. 
2. 45% reduction in natural petroleum fuels use. 
3. 20% of total use from solid fuels conversion. 
4. 20% of total use from renewable sources. 


TO gain a perspective of the extent of this application, 
twenty two federal agencies owned about 3.2 billion square 
feet of floor space in more than 490,000 buildings worldwide 
when the Executive Order was issued. DOD was the largest 
Single owner with over 394,000 (80.6%) of the total federal 
buildings, totalling more than 2.1 billion square feet (68%) 
of the federal building floor space in FY 80. The total DOD 
building energy use was 69% of the total use in all federal 
buildings {Ref. 6]. 

The results to date of federal, DOD, and DON efforts 
have been less than expected. Tabl= 1 shows the results of 
gear efforts to date. 

The reasons for this lack of satisfactory progress need 
to be examined and corrected if the sverall 20% objective is 
to be reached. One explanation might be that the goals are 
unrealistic, either in the short range, or in the final 
obyeetige, or both. Another explanation might be that the 
implementation methods are not effective or efficient enough 
to produce the intended results. On2e obvious, although 
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TABLE I 


Federal and DOD oat Reductions 
In Buildings and Facilities 


| al “aise | aga ‘aie FY 80 





| Target Reductions |{~ 2% (~ Uk | 6% {~ Bx ([  T0v | 
| Federal Reductis BS |“T73 attach i 7O%| 2e9%| FSIS | 
| sos-reauctsose--—|-r-at|-1-7] 2S Lh “TOT 7s | 
ee l-w7a-| “87 E- [ONZE] BT SE | 

Te 


impractical solution for meeting the energy goals is to 
channel greater capital investments to the bases to elini- 
mate the most inefficient plant inventory and replace it 
with more modern, energy efficient construction, or make 
major conservation effective alterations. Obviously, the 
capital resources required will be a significant factor in 
such decisions. Less dssirable actions to reduce base 
Operations, or even close down bases could provide other 
alternatives. Such actions assume that the goals are 
inflexible and realistic in nature. With the assumption that 
the energy goals are realistic and feasible, attention is 
directed to the effectiveness of the implementation process. 
Can the current means of accomplishment be expected to 
achieve the objective, or are modifications necessary to 


improve the response? 


C. PURPOSE AND SCOPE 


The purpose of this thesis is to develop more effective 


methods of evaluating progress toward achieving energy 
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conservation goals. Although the present system is quite 
functional, its simplicity, permitting some level of control 
for the vast shore activity complex, has inherent problems 
that can distort reported results. The potential impact has 
financial consequences in that capital investment decisions 
can be adversely affected by such distortions. The limited 
capital resources available to undertake corrective projects 
can thus be less than effectively channeled. 

The methods presented in this thesis are a framework 
that is intended to be general enough to be applied at the 
activity level, or functional command level, such as major 
claimants. The intent of this thesis to develop and discuss 
the realistic applications of energy use models. A brief 
presentation of concepts of theory will only introduce the 
reader to the necessary basic concepts useful in 
understanding and interpreting the basic methodology. A 
comprehensive treatment of theory will be left to textbooks. 
In order to be applied by the corporate body of energy 
managers, the methodology can not be too technical or 
dependent on *heoretically oriented decisions. The intent 
here is to show the major elements of useful models to 
suggest working tools for general application. 

The use of computers with software to handie the exten- 
sive regression and time series calculations is implicit in 
this methodology. The system used in this development was 
the IBM 370 hardware with the MINITAB software package. 
MINITAB is a terminal oriented, statistical package devel- 
oped at the Penn State University, and was found to be flex- 
ible and easy to use without extensive previous knowlege of 
computers. The method developed herein is not dependent on 
MINITAB, however, and is readily adaptable to various 
Statistical software packages. 
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The objective of the proposed methods is to identify a 
means of forecasting energy use to improve the effectiveness 
of the energy management program. The models developed will 
have advantages and disadvantages in particular situations 
that will determine actual uses. Discussion of model 
characteristics will demonstrate such applications. 

The methods will be demonstrated within the framework of 
electricity use at Naval Regional Medical Centers (NRMC). 
This narrowing of scope will allow a more concentrated focus 
on the methods, without undue confusion created by introduc- 


tion of indigenous issues. Selection of electricity as the 
energy type was based on its major, and still increasing, 
percentage of total energy use. Selection of the NRMCs as 


the sample was made on the basis of the particular charac- 
teristics of an NRMC, such as hours of operation, uniformity 
in mission and capabilities, separate siting at a location, 
concentration of spaces within one building or complex of 
buildings, and their geographical distribution affording 
sampling frem different weather zones. 


D. SPECIFIC RESEARCH QUESTIONS 


There are three major questions that will be addressed 
in this thesis. The first question is: what is the impact 
of seasonal weather variations on energy use in Naval 
Regional Medical Centers? Relative to this question is the 
impact of such effects on established baselines and 
Subsequent energy use evaluations for NRMCs as well as 
aeolie@adtion to,othér types of activities. 

The second question is: what are the effects of 
categories of use on total energy use? It is intended to 
determine whether particular functional uses of a building 
can be identified with a standard coefficient of use for 


each type of functional use. 
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The final question is: can the results be used to deter- 
mine a method for establishing a control system to identify 


realistic goals and actual energy conserved? 


E. METHODOLOGY 


Development of proposed methods of forecasting anergy 
use will be accomplished in two phases. The first phase 
Will develop forecasting models for 2nergy consumption using 
regression models and time series models. Models Will be 
developed for each of the individual activities as well as 
for the total study group. 

The regression model will assume four ovoredictor varia- 
bles: average monthly temperature, monthly heating/cooling 
degree days (sum of the departures of daily mean tempera- 
tures from the base 65 degrees Een and monthly 
precipitaticn totals. Humidity was felt to be an additional 
factor of consideration. This data was not as readily 
available from the Naval Weather Service Detachment, 
Asheville, N.C., of the Naval dIJceanic and Atmospheric 
Administration in their normal Station Climatic Summary 
publicaticn. Since the emphasis herein is on methodology 
vice numerology, it was felt that such data was beyond a 
reasonable request at this time for the courtesy service 
provided. 

Both types of models will be used to develop trends of 
energy use per square foot from the established PY 75 base- 
line. The models will be used to compare projected FY 81 use 
With actual use. Forecasts of PY 82 use will be shown. 

The second phase of analysis will attempt to determine 
standard coefficients of use for elactrical consumption by 
category code of functional ase. Data analysis will be 
performed by regression of average monthly electrical 


consumption and area of functional spaces. The main 
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emphasis of this phase will be on the baseline data. The 
development of standards of use for specific category codes 
will be shown as a means of adjusting baseline data for 
changed building uses. 

Finally, the results will be evaluated to demonstrate 
useful applications. In ) acticular, use of the developed 
forecasting techniques in a a control system will be 
demonstrated. 


F. THESIS OVERVIEW 


Chapter I introduces the reader to the background of the 
federal energy challenge and the issues involved in setting 
and meeting energy conservation goals. Chapter II discusses 
the energy conservation program for buildings and facilities 
Within the Naval Shore Establishment with regard to the 
energy conservation approach, the inherent implementation 
problems, and the resources availabl2= to achieve the desired 
results. Chapter III presents the methodolgy proposed in 
the development and analysis of models of energy use for the 
sample NRMCS and total study group to illustrate the appli- 
cation of the developed models. Standards of energy 
consumption for categories of functional useand their appli- 
cation for a modular baseline are 2xamined in Chapter IV. 
Chapter V, Summary and Conclusions, reviews the problems of 
the present system of evaluating energy reduction progress, 
and discusses the advantages of the particular energy use 
models within a control system framework. 

Appendices A-M summarize the model development for each 
of the sample sites and total study group. The parallel 
structure of these appendices will facilitate a comparison 
of results. A summary of forecasts by time series models is 
presented in Appendix N. A summary of data is provided in 
Appendix 0. Statistical tables are given in Appendix P as a 
means of ready reference. 
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Thus far, it has been shown what the building energy 
conservation goals are. Assuming that the goals are 
realistic and mandatory, this section will describe the 
imlementation of these goals within DON. It will show the 
approach taken by DON to impose the goals, the actions by 
the implementing commands, and some inherent problems in the 


implementation. 


Ae THE APPROACH 


The basic approach taken by Executive Order 12003 was to 
establish a total use standard and to specify a percentage 
reduction target. Such 2 percentage reduction was in turn 
allocated to the various departments. DON in turn allocated 
such target percentages to its fleet commands, which 
continued the suballocation by percentage goals to their 
component commands, or activities. This is a simple and 
basic approach, and assumes that every command is capable of 
identifying its energy uses and can take the necessary steps 
to reduce its energy use. By virtue of the size and 
complexity of the Naval Shore Establishment, and to an even 
larger extent DOD, the approach to energy conservation has 
to be one that is relatively simple. To get too specific 
and exacting would require an even greater bureaucratic 
network than is already required to administer the progran 
and would become very difficult to manage effectively. 

The type of energy approach in use can generally be 
termed an end-use restriction. Its primary focus is in 
achieving a particular quantity limitation. It allows a 
relatively simple means of establishing goals, monitoring 
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progress, and identifying variance. In short, it provides a 
means of immediate impact. 

There are numerous drawbacks to such an approach, 
however. The National Electrical Manufacturer's Association 
(NEMA) makes the fcllowing critical analysis of the end-use 
restriction approach: [{Ref. 7] 


The extent to which .a systen is used has no 
bearing on its efficiency. . If, for whatever 
reason, a system is inefficient, it will waste 
energy seni 2: time it is used. End-use restric- 
tions fail to take into consideration the systems 
which produce the ¢nd-use product, be it heating, 
cooling RE ee BES « nh other words, end-use 
restrictions tend to ignore the significant energy 
savings which can b2 realized by making systems 
operate as efficiently as possible. In a Similar 
manner end-use restrictions fail to consider the 
fact that every building is a unique system whose 
many systems interrelate. As a result, lowering a 
thermostat in winter can sometimes cause consump- 
tion of more energy, not less. Likewise, removing 
lamps and luminaires can sometimes cause consump- 
ne energy sources which are in the shortest 
SUPPLY - 


By way of contrast, NEMA advocates an alternative 
approach called Total Energy Management (TEM) which focuses 
less on the end-use and more on the efficiency of use. In 
describing TEM in general terms, they state: [Ref. 8] 


In. essence, TEM considers every building as a 
unigue, complex syst2m. TO conserve energy one 
first must understand how the seers ad consumes 
energy; how user needs are met; how the systems 
interrelate; how the external environment affects 
fe, and se on. By understanding how a specific 
building consumes energy, one Can make ener 
conservation improvements which can be integrate 
into the system itself. Then, when the system is 
used it runs efficiantly and therefore uses the 
least amount of energy to get the job done. 
Application of a.wide variety of end-use alnabel: 
aod4f ications--vhich are an integral element oO 
the TEM concept when applied with flexibility---- 
would result in even more savings." 


From a macro sense, e2nd-use goals are necessitated by 
the scope of the buildings involved. From a micro view, 
i.e., the particular bases, a more flexible approach, such 
as presented by TEM, is very feasible. Such an approach 
may, or may not be in use at the individual bases, or its 
component commands, already. Requirements are dictated 
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externally only in the setting of heating and cooling temp- 
erature standards, directing the execution of functions 
necessary to fulfill mission requirements, and guidelines 
for the protection of life, property, and security. 


B. MONITORING 


The Defense Energy Information System (DEIS) was estab- 
lished to monitor total energy use within DOD. 
Specifically, DEIS II identifies energy use in the buildings 
and facilities of the shor? activities. A monthly report is 
sent by each shore activitiy which identifies its energy use 
by energy scurce, i.e., fuel oil, electricity, natural gas. 
etc., unit cost of such energy source, and explanations for 
differences from target. Weather variations are often 
identified as the source of otherwise unexplained 
dif ference. 

The reports are transmitted by message to NEESA, where 
they are compiled and summarized by activity, type command, 
atc. Feedback summary reports are provided via the Energy 
Audit Report (EAR), Showing quarterly results for each 
command, and for each level of command. For example, 
Commander, 0. Ss Naval Air Forces, Atlantic (COMNAVAIRLANT) 
would receive a summary of the stations under its command. 
The next echelon of command, Commander-in-Chief, igen ois 
Atlantic Fleet (CINCLANTFLT) ‘would receive reports for 
COMNAVAIRLANT as well as the other type commanders (TYCOM) 


and individual stations reporting to it. 


C. MEASUREMENT 


The scientific nature of the reporting and monitoring 
efforts may lead tothe the erroneous conclusion that 


measurement is precise and fairly routine. Io understand 
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the limits of such data, the nature of the reporting 
commands and the means of measuring use needs closer 
examination. 

A base defines an area, or group of activities under one 
general command. Such a command may be further broken down 
into operational commands. An extreme case would be the 
Naval Operating Base (NOB), Norfolk, Virginia. Within the 
fenced perimeter of the base are two major divisions--the 
Naval Air Station and the Naval Station--identifying air 
support and ship support functions. However, there is no 
clear line of demarcation separating the two functions 
Within the fenced perimeter. Supporting activities may be 
located in eaither or both general areas. Functions such as 
the industrial Public Works Center supports all of the on 
base facilities, as well as facilities off base, such as 
family housing and NRMC Portsmouth. Each operational 
command has its own major claimant to which it reports. 
Thus, several major claimants are accountable for different 
portions of NOB's total energy use. There are a large 
Number of such on-base individual commands ranging from 
staff headquarters +o a Communications Station. Purther 
G@autLibuting to the complexity is the fact that different 
coamands might occupy one building or parts of several 
buildings. Since each command reports its energy use over 
its assigned floor space, measurement of the uses within 
buildings can be difficult. 

By way of contrast to the complexity of NOB, some 
commands occupy a complete building in an off base location, 
such aS some NRMCs. Energy use is more easily identified 
and measured in such cases, especially if the building is 
considered as a single metering customer by the power 


company. 
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Ideally, each building, if not each component use, would 
be metered so that exact use daterminations would be 
simplified. Unfortunately, energy suppliers, such asa 
power company, considers a base as a Single user. Its 
metering responsiblility is only for energy delivered "at 
*he gate". Submetering becomes the responsibility of the 
base itself. Off base locations may be metered by the power 
company as a single customer. Family housing, either on- or 
off-base is usually metered per house if billing is done on 
the residential rate schedule. 

The vast number of meters required to submeter specific 
uses represents a large potential cost to the government; 
submetering is thus considered cost prohibitive as yet. A 
pilot program to introduce portable electricity meters to 
measure specific, short term loads is to begin in FY 82. 
The purpose of this program is t959 gain perspectives on 
actual uses for energy conservation purposes rather than for 
allocation of use, though. 

The alternative to metering iS 2ngineering estimates. 
Each subcommand of a base is generally assigned a percentage 
of total base use, based on connect2d load and estimates of 
actual use. Baselines and monthly uses are assigned accord- 
ingly. Adjustments can be made if projects or management 
actions can provide justification. This method leaves a lot 
to be desired in terms of accuracy and accountability. Since 
actual use can not be identified, there is no strong incen- 
tive to conserve at the individual command on a daily basis. 
There is a dependence on large scale correction projects as 
a result. 

Without specific metering, there is a major problem of 
identifying actual savings. While engineering estimates may 
indicate a level of savings, the effects of systems! inter- 
relationships of building systems may, in fact, be energy 
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additive or deductive and overlooked in calculating energy 
use. Thus, the possibility exists of underfunding a project 
which would actually have high energy savings, or over- 
funding a project with low energy savings, on the basis of 
faulty assumptions in the calculations. Portable metering 
aS mentioned above may aid in increasing the effectiveness 
of estimation. 


De. EVALUATION 


The energy baselines are a month by month standard based 
on use in FY 75. Some adjustments have been made to reflect 
exceptional conditions that would adversely affect future 
comparisons. Current us? is compared to the comparable 
baseline period and a percentage cf change calculated. The 
calculated percentage, compared to the target percentage, 
provides a basis for evaluating progress. This comparison 
effort is done at the command level, base level, TYCOM 
level, major claimant level, etc. Statistics, such as shown 
in Table 1, provide a basis for quantitative analysis. From 
Table 1, it can be concluded that federal , DOD, and DON 
progress is below target. 

To make a valid comparison, however, one must understand 
what is being compared. There are particular inherent 
factors in the present measurement system that could 
possibly distort the data and lead to erroneous conclusions. 

One strong influence on energy use 1S weather. Hot, 
humid weather will cause greater use of air conditioning, 
which is a large electricity user. Cold weather usually 
results in higher fuel oil use, recognizing that steam 
heating is produced by 9il fired boiler plants. Electricity 
and fuel oil accounted for almost 72% of total building 
energy use in FY 1975, and 74.5% in FY 1978. The impact of 
weather on these dominant energy sources is then likely to 
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be significant. If, however, the baseline year experienced 
severe weather variation, say a long, hot summer, a mild 
Summer in the comparison year couli overstate the actual 
conservation progress. Conversely, normal weather during 
the baseline month or year and an extreme variation in the 
comparison period could understate actual conservation 
results. While weather effects are significant in the 
Magnitude of energy used, a valid comparison requires 
Similar weather conditions in order to study controllable 
consumption for evaluation purposes. 

a second consideration in the validity of comparisons is 
that the composition and specific natures of military bases 
are constantly changing. Energy use on a per square foot 
basis is an important consideration in allowing for 
construction and demolition changes. However, there is no 
consideration for the uses of buildings. Technology has 
continued to progress along energy intensive lines. Greater 
emphasis on electronics has increased the needs for environ- 
mental controls, LS siz humidity, temperature, air 
filtration, etc., increasing consumption beyond that needed 
for just equipment use. If new technological installations 
include construction of new facilities to house them, design 
and construction incorporating energy efficiencies could 
offset, at least partially, any increased total use or total 
use per square foot. Construction funding often lags 
technical installations, though, and bases are forced to 
ada pt existing facilities to meet the increased 
requirements. For example, aviation trainers have been 
installed in converted warehouse spaces, increasing the 
electrical load with no change in square footage. Such 
conversion of low energy intensive use to a higher intensity 
use will offset other energy savings achieved. 
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Building uses are also often changed to correct parti- 
cular facility deficiencies, such as lack of storage space. 
Since construction funds are limited, often with long lead 
times between project submission and project completion, 
demoliticns of unnecessary but still useable buildings are 
reluctantly requested. Bases prefer to convert buildings no 
longer necessary for their original purposes t9 contingency 
uses. The low priority given to warehouse construction, 
often identified as a facility deficiency, invites conver- 
sion of unused, Or marginally used, buildings to storage 
spaces which have a low energy intensity. Such buildings 
are usually older and beyond their expected useful life, and 
highly energy inefficient. Thus, while a base might show a 
statistical improvement, there would be no real increase in 
the efficiency of use. 

The above examples may well be practical and wise manag- 
erial applications of using existing resources, commendably 
saving construction funds. However, all can distort energy 
use comparisons. Setting standards of use by functional 
category of use would permit a modular technique of 
evaluating energy use or adjusting baselines. 

A final distortion potential consideration exists in the 
equity of energy baselines. It was previously mentioned 
that the baseline year was initially established as FY 73 
and later changed to FY 1975. A command which took actions 
to comply with the energy reduction goals prior to FY 75 
would tend to show less favorable results in later compari- 
sons with FY 75 than the base which *+ook no immediate 
reduction actions. This effect, however, would be minimal 
inasmuch as the measures taken at the time were directed at 
obvious wastes and of relatively small magnitude. 

Overall, the present evaluation system is limited to. 


providing a snapshot of use in a particular period subject 
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#0 a particular set of conditions. Such a snapshot, being 
subject to distortions, does not afford a good basis for 
comparison purposes. Nor does it permit a sound basis for 
control since it does not isolate non controllable factors 
such as weather. Modifications are needed to provide a more 
effective means of evaluating energy consumption relative to 


established targets. 


Ee CAPITAL INVESTMENT CONSIDERATIONS 


The energy reductions required to meet the specified 
energy goals will require far more than individual effort to 
tenn off lights, limit heating and cooling, or economize 
uses. Bulldings and installed systems will require major 
modifications to overcome the effects of construction during 
the era of cheap energy as well as to offset the deteriora- 
tion of buildings and their installed equipment systems. [In 
Many cases, complete replacement, incorporating energy effi- 
ciency considerations will be the cheaper alternative. It 
must also be realized that the interrelationships of tech- 
nology, missions, and support facility requirements have not 
remained constant since baseline FY 75. In particular, the 
rapid growth and dependence on elsctronics has increased 
electricity use significantly. AS a consequence, actual 
reductions will have to be greater than the planned target 
percentages to compensate for any growth. The need for 
capital resources to effect the required changes becomes 
obvious. 

The investment capital required to make such modifica- 
tions is not linearly related to the quantity of energy 
conserved. One author has described energy conservation as 
a "depleting resources industry" (Ref. 9]. The economic 
theory of depletable resources is generally associated with 
the law of diminishing marginal returns, which states that 
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when production from any activity is to be increased it can 
be done by adding capital oor labor. As additional quanti- 
ties of any one of these laputs are added to a fixed factor 
of production, the incremental or marginal returns associ- 
ated with the inputs tend to decline. The converse of this 
law is the law of increasing costs which states that, since 
the input must be purchased, the incremental (marginal) 
costs of production increase. In application to energy 
conservation, output can be considered as the energy saved, 
that is energy not required to be produced. Inputs would be 
the material for conservation capital stocks as well as the 
laber, costs of installation. A graph of cost versus energy 


conserved may be shown for the general case in Figure l. 
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It can be seen that there will bea fixed cost (FC) of 
energy. As output, conserved energy, is increased total 
costs (TC) will begin to rise at a low marginal cost (MC). 
The marginal cost, the rate of change in total costs, will 
increase as output increases. Succintly stated, greater 
energy savings incur greater costs at an increasing rate. 
Thus, initial conservation takes little capital investment. 
To achieve greater conservation levels, increasing rates of 
capital investment are needed. 

The implication of this charact2ristic is that greater 
rates of capital investment are necessary in order to 
achieve the increasing percentages of total reductions 
required. In perspective, with the growth of electricity 
requirements, there is an increasing level of capital 
investment required just to maintain the percentage 
reductions thus far achieved. 

The dramatic rise in @nergy costs since 1975 provides 
offsetting benefits for the capital investment requirements. 
Rising energy costs serve to decrease payback periods. An 
energy savings project can often pay for itself within a few 
months or years in this perspective. Under appropriated 
funding, however, the savings are a statistical savings, if 
payback periods extend beyond fiscal years. Lneeecact, 
Savings are more realistically cost avoidance in such cases. 

There are various sources of investment dollars within 
the federal budget. The response to a specific project 
approval and the funds available vary with the cost of the 
pro ject. Commanding Officers have the authority to fund 
projects costing less than $15,000 from their operating 
budgets. @ith no specific approvals required, funding can 
be done as needed. This may require sacrifices of other 
funding plans. Unless the payback period is short enough to 
realize savings within that fiscal year so that savings can 
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be applied elsewhere, given the expiring nature of 
appropriated funding, there is little motivation to make 
strong sacrifices of short term needs for the longer range 
goals. Even if the cost of purchased energy exceeds budget 
allowances, major claimants have no recourse but to fund the 
deficiency with little danger of repurcussion to the 
Commanding Officer. 

Energy programs to provide supplemental funds for energy 
conservation projects have been established in recognition 
of funding limitations and the conflicting priorities within 
operational budget funding. The Energy Technology 
Application Program (ETAP) provides funds of up to $100,000 
Pee retrofit of existing buildings. Program funding is 
requested from annual Congressional appropriations. Prior 
emo EY 82, funding and specific project approvals were 
administered by the Naval Facilities Engineering Command 
(NAVPAC). Its geographical Engineering Field Divisions 
(EFD) served +0 provide technical expertise to evaluate 
and, if requested, initiate projects as well as provide 
funding administration within their large geographical 
areas. Although NAVFAC provided final approval, the EFD 
recommendations were generally heeded. The major commands 
have now been tasked with this administration and control of 
ETAP funds. Technical guidance will continue as a NAVFAC 
function. Thase Saift of control will ailow more control of 
energy conservation progress by the particular claimants. It 
will also bring greater accountability. 

Major claimant control of their own energy destiny may 
also prove a disadvantage to the overall energy plan of DON, 
though. Funds are more likely to be invested in less 
efficient single uses, benefitting individual commands, but 
with less impact on the total DON requirements due to a 


decreased range of alternatives available to major 
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claimants. Each major claimant, if given a proportional 
share of the funds available to distribute among its total 
commands, will have less flexibility in magnitude of 
funding. Funding of a wider range of smaller projects can 
be expected, thereby missing opportunities for optimal 
funding in a wider range of greater savings projects. 


Energy conservation projects to retrofit existing 
buildings costing over $100,000 fall under the Energy 
Conservation Investment Program (ECIP). As these funds are 


within the Military Construction Program (MILCON), they are 
administered by NAVFAC. Projects are submitted via the EFDs 
for technical approval and forwarded to NAVFAC as a line 
item budget submission for Congressional funding approval. 
Project approvals are based on economic justifications and 
subject to the budgetary politics of the federal budget 
process. With the requirement for Congressional approval, 
the response ‘time between project submission and completed 
construction is slowed to a minimum of three fiscal years. 
While ETAP and ECIP funding provide capital investment 
funds to effect energy conservation corrections, it must 
also be recognized that the political realities of 
appropriation funding tend to limit the amount of funds 
available. Cost avoidance is not an attractive proposition 
to the political entrepreneur. Voters receive no tangible 
benefits in energy conservation projects and see only an 
increase in government spending. Actual cost reductions are 
hidden in the amount of continuing rate increases. As a 
result, conservation funds are subject to the "budget axe! 
when budget reductions are necessitated. The energy 
investment programs are in fact more saleable under the 
patriotic banner of en2rgy independence from foreign 


sources. 
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The combined effect of these factors have put a large 
strain on investment dollars. The net result of the 
diminishing marginal returns nature of energy conservation 
serves +o increase the greater need for investment dollars. 
Coincidentai to this need, the amount of dollars made 
available tend to be limited and restrictive. The more 
decentralized control of investment funds at the ETAP level 
may limit the effectiveness of investment funding further. 
The message then is very clear: investment decisions will 
have to made wisely to achieve th2 maximum effectiveness 
from +he dollars available. Accordingly, rational 
decisionmaking will depend on the accuracy and reliability 
of the information on which it is based. 
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A. MODEL TYPES AND USES 


The decisionmaker, for purposes sf this thesis, will be 
defined as the person evaluating the energy reduction 
progress of an activity or group of activities with access 
+o Capital investment funds. The issues for the activity 
decisionmaker are how to effect the necessary energy reduc- 
*ions, what investment funds are neeied to implement further 
reductions, how effective the implemented measures have 
been, and what level of reduction can be realistically 
expected relative to the prescribed targets. A higher 
echelon command, such 32S tne major claimant, would be 
confronted with allocating its limited investment dollars. 
Both decisions would necessitate evaluation of the curent 
and expected progress. 

The basic document to provide current progress is the 
Energy Audit Report (EAR). Figure 2 is a sample of such a 
report. The evaluator can read electricity use in MWH and 
MBTU for the baseline and current 12 month periods, the 
percent change from PY 75 and percentage of total use. Bet 
further shows PY 75 and current fiscal year floor areas and 
percent change from FY 75, MBTU/thousand square feet 
comparisons, and a comparison of total reduction and target 
reduction. EARS are provided on individual activities on 
¢heir use as well as to higher e2chelon summarizing the 
progress of their reporting commands. 

In a Simplistic, cursory examination of the EAR, one 
could assess whether an activity was above or below target. 
One could also identify the progress by energy type. In 
this example, the activity relied on electricity for almost 
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one third of its energy use but had only achieved a 2.7% 
reduction in such use, offsetting the large percentage 
decreases in its other energy sources. 

Taking an EAR at its face value might well lead to the 
erroneous conclusions discussed in Chapter II. But the 
energy manager has little else to base his conclusions on 
except to review past EARS and davelop an intuitive or 
perhaps a graphical picture of the general energy reduction 
progress. Would a 12% increase this fiscal year have really 
indicated non complianc2? Or were temporary phenomena 
responsible? Does the 10% decrease indicate compliance? An 
example of this issue is provided by a review of electricity 
use at NRMC Jacksonville 25 shown in Figure 3 A sharp peak 
for May 1978 is evident. A comparison of FY 78 with FY 75 
Shows only a 5% decrease, corresponding toa taraet of 6%. 
The EAR, by itself, would not show the use spike. The 1% 
variance might not be considered too serious. However, 
visual inspection of the data would raise questions 
concerning the real decrease percentage. 

In verification of the data with the activity, it was 
learned that new water chiller units were being tested 
during that month, "probably" causing the spike. The 
activity might have reasoned that without this created extra 
load, it would have achiaved an 8% ceduction instead of the 
reported 5% decrease. Without knowing the real effect of 
the additional use for the water chillers, such reasoning 
might be erroneous. A large expected use, perhaps due to 
weather conditions, would have shown less than a 3% differ- 
ence due to water chillers. A low expected use might have 
shown more than a 3% increase. In either case, the energy 
manager could have evaluated the Situation and taken 


appropriate action as a result. 
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The implication of this type of framework is the exis- 
tence of a control systen. But, such a control system is 
dependent upon the ability to determine expected use, rather 
than setting a target, directing effort toward that goal, 
and suffering the blurring effects of noncontrollable 
factors. 7 

Predicting future events, or forecasting, with relative 
certainty has become a scientific discipline in the last 
several decades, particularly with the widespread intoduc- 
tion and use of computers. Programs have been made readily 
available for almost all quantitative forecasting techni- 
ques. "The need for forecasting is increasing as management 
attempts to decrease its dependenc2 on chance and become 
more scientific in dealing with its environment” (Ref. 10]. 
Techniques vary considerably with tha situation, time hori- 
zons, factors involved, assumptions made, and types of data 
patterns. A basic underlying premise in almost all fore- 
casting methods is the assumption of constancy, i.e., the 
pattern of the past will continue into the future. 

There are two major types of forecasting modeis: regres- 
sion and times series models. Regression models assume the 
factor to be forecasted has a continuing relationship with 
one or more independent variables. The purpose of the model 
is to exploit those relationships and use them to forecast 
future values of the dependent variable. System responses 
are observ2d and input relationships of that response are 
developed. To use Ohm's Law aS an illustration, voltage (E) 
can be predicted by inputs of current(I) and impedance(Z), 
and the known relationship E=TI Xx @. The key to this 
method is identifying the significant input variables 
(stimuli) and their relationship to the the output 


(response). 
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The time series nodel, On the other hand, makes no 
effort to explain the factors that affect system response, 


treating the system as a black box. Its main concern is 
observing particular events and predicting future states. 
Weather can be forecast by use of a time series model. The 


pattern of historical events is described by mathematical 
relationships and used +o forecast expected temperatures, 
sunlight, and even rainfall, Without necessarily under- 
Standing the climatic conditions responsible for the 
results. Both models have particular advantages in parti- 
cular applications and will be examined in this thesis for 
application to energy forecasting at Naval Shore Activities. 


Be REGRESSION MODELS 


th. Gapneents of Electricity Use 


In this section, ain energy forecast model based on 
four weather variables will be developed using regression 
techniques. The main purpose of this section will be to 
demonstrate the techniques and discuss the interpretations 
of the regression model. A more specific application of 
regression techniques will be developed in Chapter IV to 
identify the relationship of energy use and functional uses 
oe ectmvity buildings. 

In order to build a model to ‘explain' electricity 
use, it will be useful to iefine some of the basic terms and 
concepts of electricity consumption that will be important 
considerations. A relatively simple nodel of residential 
home use provides a useful framework for discussing basic 
concepts. Rach electrical device is rated as to the power 
it uses +o perforga its function. The rating is given in 
terms of watts. The electricity used during that perfor- 


mance is the product of the power rating and the hours of 


38 


use, e€xpressed in terms of watthours. A 60 watt light bulb 
left on for 8 hours consumes 480 watt hours, or .48 kilo- 
watthours. The British Thermal Unit (BTU) is often used to 
express energy use in terms of a common denominator relative 
to other energy sources, such as steam, coal, or fuel oil. 
One BTU is equivalent to .0116 kilowatthours, after adjust- 
ment for particular distribution losses. The previously 
cited example of the light bulb consumption could be 
expressed equivalently as 5,568 BTUs. The amount of power 
(watts) required to energize all devices connected to a 
system is its connected load. A residential home use could 
be calculated by summing the watt hours of each use of its 
connected load. In reality, the calculated amount would be 
less than the actual amount used Since every unit has some 
associated energy loss, or inefficiency. The losses ina 
typical residential unit are relatively minor, however. fhe 
energy efficiency of a device or a system is defined as the 
percentage of useful energy (rated watts x hours - losses) 
to delivered energy (rated power x hours). 

A simple residential home model would be expected to 
contain explanatory variables relating the effects of 
weather, number of occupants, and the hours that the unit is 
occupied. The model would express the amount of change 
expected in the electricity consumption per unit change in 
any one of the variables, or stimuli. To control the 
response within particular limits, it would be necessary to 
control the variables, if possible, or adjust the systen 
response to a variable. Since weather is not controllable, 
adjustment of the system response could be effected by 
increasing the insulation, resetting thermostat controls, or 
even replacing inefficient heating or cooling units. The 
model would indicate the change in electricity use caused 
by a unit change in any one of the variables when the other 
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variables remain constant. A user could select the vari- 
able(s) POmmecontrol “that would influence the desired 
response. 

Medels £or industrial electricity, such as for a 
military installation, are considerably more complex and 
involve subsystem interrelationships. Power to the | 
installed loads is often three phase power. Simplistically, 
the total power to the load is provided by several wires 
which provide the total raquired power. It is important to 
the efficiency of the power use that each phase have the 
same loading. A system is said to ba balanced if each phase 
delivers the same amount of power toa system. As the 
System becomes unbalanced, the efficency of use decreases. 
This concept applies to an entire building supplied with 
three phase power, a particular internal circuit, or a 
particular type of load. The efficiency of a building's use 
can be improved by shifting loads +¢)5 other circuits to gain 
a better balance. 

The power factor is another important item in energy 
efficiency. The power factor is the cosine of the phase 
angle between the current and voltage, ranging in absolute 
value between 1 and 0. A pure capacitive, or pure inductive 
load will have a phase difference of 90 degrees, and there- 
fore a power factor (cos 90 degrees) of 0. The product of 
pewer factor and rated power is used to determine the useful 
or real power used relative to the power delivered. Thus, a 
load with a power factor of 1 would have no losses asa 
result of the phase difference, while a load with a power 
factor of 0 would be a total loss of delivered power. A 
pure capacitor, for example, would store energy delivered to 
it and would have a power factor of 0. By adding an induc- 
tive load of equal magnitude, the resultant system power 
factor woulé increase to 1. Thus, another industrial energy 
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conservation measure is to attempt to balance inductive and 
capacitive loads. In fact, power companies often give rate 
incentives to industrial customers who maintain a power 
factor of at least 0.8 as a means of increasing the power 
system's overall efficiency. 

The major concept to be understood by the reader is 
that system interrelationships are significant in explaining 
the system total response. A change in one component of a 
system may in fact produce an 2ffect opposite to the 
intended response. Relative to weather effects, warn 
weather resulting in the use of large air conditioners may 
increase or decrease the efficiency of a particular systen 
or subsystem. Thus, a simple calculation of the rated power 
use of a particular load may not necessarily provide the 
true overall response. consequently, the definition of an 
industrial system response needs to consider the interrela- 
tionships of its subsystems. Determining the systen 
response is not a simple matter of inventorying connected 
loads. 


2. Concepts of Statistical Theory 


This thesis does not presuppose a strong background 
= stetistics, nor is it the intent to provide one. 
Textbooks suitable for the reader's particular background 
would provide a more thorough and appropriate source of 
efor kation. The text of this thesis is written for the 
corporate body of energy managers who may have had only 
minor exposure to a study of statistics. Conseguently, a 
brief presentation of important statistical theory concepts 
will be given only to provide a better basis of under- 
standing the interpretation and application of regression 


modeling. 
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Regression models were seen to be based on deter- 
mining a system response in terms of the relationships of 
particular input variables, or stimuli. Theoretically, 
there exists a true function that completely explains 
output. Such a deterministic model would also be capable of 
forecasting future values providing the definitive equations 
@aag net change, i.e, system interrelationships and input 
variables remained valid. A true deterministic model of 
electricity use would detail every use down to the effi- 
ciency of the connected loads and the effect of energized 
loads on load balance, power factors, etc. Obviously, such 
a model would be difficult to develop and very complex. 
Despite the ability of computers to rapidly compute the 
response once the appropriate data is input, collection of 
that data can be costly in terms of time and money. A 
lesser model might be sufficiently accurate and far less 
costly. A useful model would be one which could reliably 
estimate the ‘true’ function and be practical enough to be 
effective. 

Since the applied nodel is not deterministic, there 
will always be error, or differences between the expected 
values of the model and the observed data. The technique of 
least squares is used to minimize the error of a fitted 
curve to the input data. The sum of the squares of the 
errors becomes a measure of determining how well the 
estimator curve fits the data. The coefficient of 
determination, or R2, is the ratio of ‘explained error to 
he total error. Its values then would cange fron QO, 
indicating no relationship between the stimuli and response, 
te ie @ediGating a perfect fit. Maximizing R2 is one means 
of determining the usefulness of a model. 

The addition of variables to a regression model can 


not decrease the R22 value, appearing to improve the 
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prediction capability of the model. However, an increase 
may be spurious, caused by measurement errors, or chance 
results of unrelated data. A plot of R2 against the numbers 
and combinations of variables will result in a logarithmic 
type curve which flattens as it approaches a maximun. 
Adding variables in this region will provide little increase 
enw R= . 

Another means of evaluating the fit of the model nay 
be made by examination of the mean squared error (MSE). AS 
its name implies, MSE is computed by squaring the individual 
error for each data point and then finding the average or 
mean value of the sum of those squares. The MSE gives 
greater weight to large errors than to small errors since 
the errors are squared before being summed. Plotting the 
MSE against the number of variables results in a parabolic 
curve. The graph will show a minimum point for a particular 
combination and number of variables. A decrease in the MSE 
with added variables, may not be great enough to justify the 
cost of additional data collection. 

The variance of a function, known as S2, indicates 
the dispersion of the data around the mean. This value is 
used to assess the extent of possible error. The F 
statistic uses the ratio of the ‘explained! variance to the 
‘unexplained’, or totally random, variance about the mean. 
If there was no correlation of the output variable with the 
input variables, the mean of observed output would be the 
best predictor. Since all variance would be ‘unexplained’, 
the ratio should be 1. AS more variance was explained, the 
numerator would increase. The F statistic, with a known 
probability distribution, can be used to compare the 
explanatory power of different models or assess the various 


characteristics of a particular model. 
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Estimating the 'true' function, recognizing that 
error is involved, ieaves room for doubt that a model has 
predicted a realistic value. Confidence levels are used to 
express the probability that a predicted value is accurate 
relative to the ‘'true' value. A confidence level would 
indicate the percentage of times, o2on the average, that a 
particular outcome would be observed in repeated trials. 
For confidence intervals, the desired outcome is that the 
computed interval include the ‘true’ value. Increasing the 
confidence level causes the interval to be longer, but less 
informative. All calculations used in this thesis use a 95% 
confidence level. 

The confidence level is calculated based on known 
distribution characteristics. The cumulative area under 
such a curve is the cumulative probability. The central 
limit theorem states that as the sample size n increases, 
the distribution of the mean of a random sample taken fron 
practically any population approaches a normal distribution. 
A standardized normal distribution, with its mean of 0, and 
standard deviation of 1, can then be used as a prototype 
from which to make certain statistical and probability 
inferences. A useful property of the standardized normal 
distribution is that about 95% of the area, oor probability, 
lies within 2 standard deviations of the mean. The stand- 
ardized normal curve and table of values is shown in 
Appendix P. 

The Student's t, ox simply ¢t, distribution is used 
for small samples. The t distribution is actually a family 
of curves identified by degrees of freedom. Degrees of 
freedom relates +o the number of variables in an equation. 
The residuals for the variables, i.e., the errors or differ- 
ences between obseverved and calculated values, are said to 
be 'free' in that they can be any value, with the condition 
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that their sum is zero. Thus, not every residual is free 
once other residuals have value. Computer printouts usually 
print degrees of freedom as a normal output of interest. it 
distribution tables, such as provided in Appendix P, will 
yield a standardized deviation from the nean. 
Multiplication of this value by the model's standard devia- 
tion will yield the probability that a particular value will 
be greater than an expected value. It can be seen that the 
t distribution and standard normal distribution values 
converge for large numbers of observations. 

The aptness of a model may also be determined by 
examining its prediction capability. A condition of aptness 
1s that error is randomly distributed with constant variance 
around the mean over a range of values. A plot of residuals 
against the individual variables will demonstrate €rror 
distribution. Figure 4 demonstrates possible patterns of 
residual error when plotted against a variable. Figure 4 
(a) illustrates constant variance (homoescadicity) over the 
range of X, satisfying the basic assumption. Figures 4& (b), 
(c), and (d) exhibit non-constant variance (heteroscedas- 
ticity). Use of the variable exhibiting the characteristics 
of any of the latter three examples, will iatroduce non 
random, focused error into particular data ranges and have 
undesired influence within the model. Correction of heter- 
oscedasticity may be accomplished by transformation of the 
variables. Residual analysis, other than noting that parti- 
cular variables may have undesirable influence without 
adjustment, is beyond the scope of this writing, however. 

Multicollinearity, or correlation between indepen- 
dent variables, can also produce undesirable effects ina 
model. At best, these effects are unpredictable. 
Multicollinearity should be avoided, if possible. 
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Figure 3.3 Prototype Residual Plots 


C. REGRESSION MODELING TECHNIQUES 
1. Parameter Selection 
A regression model will have the general form, 
Y= bO + b1 X71 + b2 X2 + ..--+ DK Kk + error (ée) 


The initial selection of variables 2s predictors of system 


response depend on the purpose of the study, logical 
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assumptions of relationships, the practical cost/benefit 
considerations of data availability, and the desired 
complexity of the model. Selection sf the weather variables 
for this analysis provides amplification of these points. 
The purpose of a weather parameter study would be to test 
and compare weather effects on energy use. The general forn 
or the weather model that will be developed is given by, 


MBTU/SPF=bO +b1 AVGTEMP +b2 HDD +b3 CDD +b& PRECIP 


The results of this model will assist in answering pertinent 
questions, such as, Does electricity use show a strong 
correlation with weather factors? Which weather factors 
have the strongest influence? Is there a general relation- 
ship between activities in the same climate zone, or even 
among particular types of activities? More important, 
though, is what affect does the model have on management 
decisionmaking? 

An important consideration to realize before final 
selection of the variables is that the final results can be 
applied only within the context in which they were deter- 
mined. The final model will show only a relationship 
between the input variables. For example, cooling degree 
days may show a strong influence in a particular nodel. 
That influence may only be relevant in the presence of the 
other selected variables. The addition of another variable, 
such as hours of sunlight, may decr2ase the relative signi- 
ficance of cooling degree iays and increase the significance 
of phecapitation. Selecticn of parameters for study should 
be given a broad consideration first with a narrowing focus. 

Availability of data and cost of the data are impor- 
tant for obvious reasons. The cost of acquiring data, in 
terms of time and money, must be cost effective. It makes 
little sense to acquire data that requires many manhours to 
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assemble and refine without a strong improvement in a model 


and a high value placed on such improvement. For example, 
total year weather summaries, published for each weather 
station, were used to collect the necessary weather data 
Med if this study for each activity. Selection of a wider 


variety of variables would have required additional 
processing costs by the National Climatic Center, and a 
greater compilation prior to computer processing. The bene- 
fits of additional data were considered less than the cost 
and therefore not collected. 

The accuracy of data must also be considered within 
a cost context. Regression is often used in cost estimating 
large government contracts, such as the acquisition of 
weapons systems, sShios, and aircraft. Historical data to 
the nearest penny requires a great deal of research; ‘ball 
park' estimates defeat the purpose of forecasting cost. A 
reasonable amount of accuracy is determined by the cost of 
acquiring data andthe purpose such accuracy will serve. 
Similarly, the floor areas of reporting activities must be 
recognized as having limited accuracy. EARS show only a 
Single area for a fiscal y2ar. Allowing the possibility of 
anet effect of zero during the year for construction and 
demolition combinations, the sometimes lengthy transition 
period between full occupancy and inactive status for new 
buildings or building demolitions, the attempt to refine 
Square footage to a completely accurate number would have 
been cost ineffective since the final results may show only 
a small change for accuracy adjustments. Lt 357) higaly 
unlikely that records would be kept to such a level of 
refinement. Records pertaining to actual energizing/deener- 
gizing dates and occupancy status would be an exception 
rather than the rule. However, the user should realize the 
limits of the data and the effect on the final model before 
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expecting too much from the model. Alternatively, model 
uses can be specified to insure the data will be of 
appropriate accuracy. 


2. Building The Model 


Although graphical comparison of system response and 
various system stimuli is not necessarily an integral part 
of regression modeling, it does provide a visual indication 
of particular patterns and trends that may suggest certain 
relationships. Parts A and B of Appendices A-M exhibit 
Graphs of energy use and weather conditions from FY 75 
through FY 81. The various energy use graphs show a variety 
of patterns. NRMC Corpus Christi, for example, shows 
distinctly different patterns before and after the start of 


7%. 79. Braor toeFrY 79, there was relatively small varia- 
tions between successive peaks. The largest peaks seem to 
occur about July of each year. The mean MBTU/KSF was 


decreasing through FY 76, stabilized in FY 77, and rising in 
BY 78-79. The July peaks after the start of FY 79 show a 
Sharp increase although winter minimums also show a signifi- 
cant decrease. It can also be seen that there was a milder 
Winter in the baseline year. In fact, there was a mild 
Winter, relative to successive years, at almost all of the 
Sample sites. The composite study group also reflects this 
occurrence. 

An EAR provides only yearly energy use summary 
statistics. The averaging of the wide variations shows only 
a relative degree of change of the mean. The visual picture 
of the monthly energy use might prompt a different type of 
response for the decisionmaker. One might speculate that 
the activity added large air conditioning units causing the 
large summer peaks. Without such air conditioning use, it 
would be possible that the remaining use level had 
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decreased, as indicated by the winter lows. The 
decisionmaker, even with confirmation of these hypothetical 
explanations, would be unable to assess at ‘this point 
whether the increase was justifiable, assuming the necessity 
of the air conditioning. Did weather conditions, i.e., 
unusually hot, © humid conditions cause greater use? Or is 
air conditioning use excessive? 

The results of the four weather variable model for 
NRMC Corpus Christi is shown in part C of Appendix c. The 
resultant equation is seen to be, 


Y=29.7-0.0075 AVGTEMP-0. 0086HDD+0. 0169CDD+0. 246PRECIP 
(1.47) (-.02 ) (-.67) (1.46) (1.23) 


Interpretation of this equation is that for a unit change in 
any one of the input variables, sytem response will change 
by the signed magnitude of the variable coefficient while 
the remaining variables remain constant. The magnitudes of 
the coefficients alone, however, do not indicate their 
importance, since the units of each variable are different. 
Relative significance is datermined by the t-ratio (coeffi- 
cient/standard deviation). fT-rcatios are shown in 
parentheses below the coefficients. CDD is seen to be thes- 
trongest variable by this criteria. Recalling that the 
equation is an estimator of the ‘true’ function , the user 
might want to test the hypothesis that the individual coef- 
ficients are 0. That is, is the estimated coefficient non- 
zero because of the error in the estimate? To determine 
whether the value is significantly different than 0 with 95% 
confidence, an evaluator would compare the observed t ratio 
to the tabulated ¢ distribution. Since each ratio is less 
than 2, the value of 0as the true value for each 


coefficient would be accepted. 


50 





The t test can be applied only to a single coeffi- 
cient. Even though none of the coefficients havea t 
Beatistic greater than the critical value of 2.0, it can not 
be said that all coefficients are 0, since deletion of one 
variable could change the relativ2 significance of the 
remaining variables. To test the hypothesis that all coef- 
ficients are 0 simultaneously, a similar test can be 
performed with the F statistic. In the case of NRMC Corpus 
Christi, F is computed as 16.9 (498.94/29.53). The FP table, 
as provided in part C of Appendix P would be used. Entry of 
the degrees of freedom of the ‘explained error (4), and 
degrees of freedom for the *‘unexpained* error (79), yields a 
value of between 5.66-5.69. Interpretation of this test is 
that the risk of assuming that all coefficients are zero, 
with an F=16.9, is 95%. It would thus be concluded that all 
coefficients are 0. 

The next step of analysis would be to delete the 
weakest relative variable, in this case average temperature, 
to narrow the focus of consideration. However, the fit of 
the model by R2 and MSE criteria need to be examined to 
compare the relative changes in other models. The R2 value 
is seen to be 46.1%, which will be the maximum for up to 
these four variables. It is also, aS expected, an indica- 
heon Of a weak fit. ( A value in the order of 90% would be 
hoped for in this type of prediction mnodel.) The MSE (mean 
Square of the residuals in the analysis of variance) is seen 
eo pe 29.5. Also shown in the analysis of variance isa 
listing of fourteen points which were identified from the 
total of 84 as having adverse effects either by a large 
Standarized residual(R) or a large influence by an X value 
(X) . 

The correlation summary indicates a further problen 
of multicollinearity. A correlation of .956 between average 
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temperature and CDD indicates that the variations in the two 
variables are almost identical and nove in the same direc- 
tion. The -.884 correlation between average temperature and 
HDD indicates thse two variables are also strongly corre- 
lated but moved in opposite directions. That is, as average 
temperatures rose, CDD increased ani HDD decreased. These 
results, as previously discussed, were expected. 
Elimination of average temperature to reduce the unpredic- 
table effects of multicollinearity would further be 
suggested. 


3. Evaluation of ‘Best’ Model 


Fach of the models which have been developed for 
four weather variables 1s considered to be the full model. 
That is, it contains all of the variables intended. AS seen 
earlier, it will have the best fit since adding variables 
Will always improve tke R@ value. But the full model may not 
always be the ‘best' modal. The criteria for ‘best is 
established by the decisionmaker based on the intended model 
use. 

Prior to considering a model for selection, a deter- 
Mination of aptness should be made. It will be assumed 
herein that all models ar2 apt since residual analysis is 
more appropriate to discussion of developing realistic 
models rather than the nethodological approach taken by this 
thesis. The weather variables selected for this study are 
kgown to be interrelated and of little practical value by 
themselves. 

Values of R2 and MSE are the most commonly used 
ifascatcrs of fit. Criteria to be used will depend on the 
decisionmaker's needs and preferences. AS previously 
discussed, a curve of R2 against combinations of variables 
will generally show a sharp rise initially before flattening 
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as 1t approaches a maximum value. The ‘elbow rule criteria 
sets the selection of ‘best’ at the point where the flat 
zone begins. R@ will change very little for adding 
variables. 

The parabolic curve of MSE against variable combina- 
tions will decrease to a4 Minimum point and begin to rise 
again past the ‘best point. The degree of change as MSE 
approaches the minimum may also be very slight, though, 
indicating that the addition of a variable will decrease MSE 
by only a small amount. The minimum point may not be the 
Same point indicated by the R2 telbow rule criteria. 

The F statistic also shows the explanatory power of 
a model. The F statistic has no maximum or threshold value 
criteria. A minimum criteria can set based on the confi- 
dence level test that all variables are not 0, as discussed 
earlier. The minimum point determined for a 95% confidences 
level was found «+o be 5.68. Below this F level, the model 
would be rejected. 

An important concept, basic to either criteria, 


however, 1s cost effectiveness. The cost of a model must be 


less than its value. The cost effectiveness should decide 
the complexity of the model. All else being equal, model 
selection should ba decided by optimizing cost 


effectiveness. 

Table 2 summarizes the results of regressions for 
each of the 15 combinations of models for one to four varia- 
bles of each sample. The R2 and MSE values are shown for 
the best model in each category of 1, 2, 3, or 4 variables. 
Continuing the use of NRMC Corpus Christi for illustrative 
purposes, a 'best' single variable nodel would use CDD as a 
predictor, with an R2=44. 3% and MSE=29.42. A "best two 
variable model would add precipitation as a predictor with a 


corresponding gain in R2 of 0.9%, with a decrease of 0.16 in 
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MSE. A three variable model, adding HDD, was found to 
increase R?@ by 0.9%, with a decrease in MSE of 0.10. The 
addition of average temperature as a fourth predictor had no 
noticeable effect on R®, but increased the MSE by 0.37. The 
P statistic shows a continuing decline as variables are 
added, although staying above th minimum significance 
level. Assuming little cost difference between a 2 variable 
and a3 variable nodel, the 'best' overall model would 
appear to be the three variable model 13. 

Table 2 provides a basis of making some general 
observations. Selection of the ‘best' models based on 
Changes versus absolute values of R2 and MSE, and minimun P 
value criteria, would have made the three variable model the 
most popular. It 1s inter2sting to note that CDD figured as 
the best predictor in all but 4 of the 13 single variable 
models, in all but 1 of the two variable models, and in 10 
of the three variable nodels. Non selection of a CDD model 
was then due to marginal differences in almost every case. 
The idea that more may not be best was clearly demonstrated. 
Full models for Oakland and Jacksonville, as well as the 
three variable model for Oakland were rejected on the basis 
of F test significance. 

The approach to model sélection would generally be 
the same for the activity level and upper echelon decision- 
makers. However, the issue of cost effectiveness would be 
greater at the upper echelon command. The activity level 
decisionmaker could be relatively indifferent to the differ- 
ence in complexity between a three and four variable model, 
or between a two variable and a three variable model. 
However, an upper echelon decisionmaker evaluating a group 
of reporting commands would find the complexity difference 
multiplied by the number of cveporting activities. In this 


study, the decisionmaker would have to consider the cost of 
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acguiring and processing twelve sets of data against the 
value of increasing the R2 value by 0.1, with almost no 
change in MSE, and a moderate decrease in the value of FPF. 
The purposes of the information would be a large influence 
in the selection. It would be 2xpected that the three 
variable model would be selected. 

Regression modeling can be seen to be an involved 
process. Its value is in the determination of relationships 
between sytem response and specific inputs. The relation- 
Ships would be useful for developing a plan of corrective 
ecticn.. For example, the strong effects of cooling degree 
days would suggest investigation +o determine ways of 
reducing such effects. An activity may decide to alter 
circuits to gain a better balance in phase loading, and/or 
power factor corrections. The effect of air conditioning 
could make a difference during the cooling season and be 
Opposite in effect during the reamainder of the year. 
Biannual load adjustments might then be appropriate to 
compensate for such effects. Incr2ased insulation against 
heat losses or gains might have cumulative effects during 
heating and cooling seasons. Initiation and prioritization 
of projects +o implement necessary changes Might be 
effected. Similarly, an upper echelon command might review 
ies PEeiorities of funding based on the indications of 
particular regression models. 

Evaluation of projects, however, is perhaps most 
Significant at the E£EFD technical review. The estimation of 
energy savings and payback periods provided by the engi- 
neering staff is critical to the funding of projects in many 
cases. Erroneous assumptions of system interrrelationships 
could well provide the difference between funding or not 
funding a project that could either save energy or waste 
investment capital. Regr2ssion analysis provides a further 
tool that can be useful in assisting such decisionmaking. 
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Be MFIRE SERIES MODELS 


Regression analysis provides a capability of examining 
relationships to define system output. Time series models, 
on the other hand, are concerned only with system output. 
The model defines historical patterns and trends of a vari- 
able by mathematical equations which it uses to forecast 
future states. The assumption in both cases is that the 
patterns and trends will continue into the future. 

A particular value of this method is in the ability to 
minimize the effects of unusual events and base forecasts on 
expected states. One of the concerns expressed for the 
present sytem of evaluating energy reduction progress is the 
susceptibility of energy use to noncontrollable weather 
conditions. The strong relative influence of cooling degree 
days on electricity use has been shown. Although a definite 
factor of use, its influence would be seen in a comparison 
between months of differing temperatures. Comparison would 
be enhanced by accounting for a difference in such noncon- 
trollable factors. A review of the monthly electricity use 
and weather summaries in parts A and B of Appendices A-M 
illustrate definite patterns. 

NRMC Great Lakes shows a specific example of the occur- 
rences of unusual weather variations. The average 
temperature in January 1977 was 15.5 degrees lower than the 
Same month in the FY 75 baseline. A corresponding increase 
of 519 heating degree days (44.7%) in FY 77 is then seen. 
Comparing just the peak cooling degree days, disregarding 
the month of occurrence, also showS anh increase of 38 
cooling degree days, almost 10% greater, in FY 77. A 
comparison of energy uses would certainly be influenced by 
these moncontrollable factors and somewhat distort an 
appraisal of energy conservation progress. FY 75 seems to 
have had a milder winter at almost every one of the sample 
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sites. The total group summary also reflects this fact. 
The evaluator needs a means of comparing uses that allows 
consideration of such effects. By basing its forecasts on 
historical trends and patterns, the time series model 
provides a standardized, and perhaps more meaningful, basis 
eof comparison. 


1. Testing For Time Series 





Autocorrelation 1s used to describe the association 
or mutual dependence between values of the time series at 
different time periods. It relates a series for different 
time lags. A pattern in 2 plot of residuals may imply auto- 
correlation. One statistical test for the existence of 
autocorrrelation is the Durbin-Watson, or D-W test. The 
Durbin-Watson distribution and tabled values are shown in 
Part D of Appendix P. Upper and lower D-W values of D-W(u) 
and D-W(1) are read for the appropriate number of indepen- 
dent variables (k) and sample size (n). The distribution 
curve, symmetrical around 2.0, is divided into five inter- 
vals: (1) less than D-W(1), (2) between D-W(1) and D-W(u), 
(3) between D-W(u) and 4 - D-W(u), (4) between 4 -— D-W(u) 
and 4&4 - D-W(1l), and (5) more than % - D-W(l1). If the D-wW 
value is in interval (1) or (5), i1¢ is likely that autocor- 
relation is present. The test is inconclusive if the test 
value is in intervals (2) or (4%). 

The D-W test value for each sample is shown in part 
C of <aAppendices A-M. Por 4 independent variables and 84 
observations, D-W(1)=1.49 and D-W(u) =1.68. Thus, for this 
study, if the D-W statistic is between 1.49 and 1.68, it 
would be concluded that there is no autocorrelation and time 
series modeling would not be indicated. Camp Lejeune was 
the only activity in the no autocorrelation interval; San 
Diego data proved inconclusive. All others activities 


indicated definite autocorrelation. 
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In this section, the basic concepts needed to under- 
stand the basic premises of time series analysis before 
application to modeling techniques will be presented. 

The general form of the ragre2ssion model was estab- 


lished as, 
eee se bi me £ b2 X2 +.....2.% DK Xk + error(e) 


A similar concept in time series analysis is autoregression 
which relates past vaiues 9f a depenient variable to itself, 
dase 4 auto(self) regression. Th? general form of «the 


equation would then be, 
Y = dO + b1 Y(t-1) + bD2 Y(t-2) #.....+ bk Y(t-k) + e 


A time lag is the time interval offsetting the variable 
being forecast. The term Y(t-2) indicates a time lag of 2 
from Y(t). Weather data would hav2 a time lag sf 12 months, 
indicating a 12 month rapeating pattern. 

Autoregression differs from regression in that the 
residuals of the independent time series variables, i.¢., 
Y(t-1), Y(t~-2), etc., usually depend on each other. The 
number of independent variables to include in 3a time series 
is more difficult to determine in autoregression. 

Autocorrelation coefficients (ACF) can be computed 
as an indication of how successive values 9f the same 
variable relate to each other. They are also useful in 
determining whether data 2re random, stationary (oscillate 
around a constant mean), the level at which data becomes 
stationary, data seasonality, and th2 length of seasonality. 
Tests similar to those shown for regression analysis can be 
applied to ACFs cto determine characteristics. di oh a: OR Sag | 
random data have a sampling distribution that can be 
approximated by a normal curve with mean zero and standard 
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approximated by a normal curve with mean zero and standard 
deviation of Yn . Thus, r2siduals are not considered randon 
if less than 95% of the ACFPs are within 2 standard 
deviations of the mean. 

The characteristic of stationarity can also be seen 
by analysis of ACF. Stationarity means there is no growth 
or decline in the data, i.2., data fluctuates around a cons- 
tant. horizontal mean. Electricity use data will be 
expected to be nonstationary, hopefully decreasing. Weather, 
however, should be relatively stationary. ACF of stationary 
data drops to zero after two or threes time lags, whereas ACP 
of nonstationary data will be significantly different from 
zero for several time periods, exhibiting a trend with 
increasing time lags. Removal of nonstationarity is neces- 
sary to eliminate the effects of a trend in the ACFs before 
proceeding in time series analysis. This is achieved by a 
method of differencing. A new series is created by 
subtracting successive values and using those differences as 
anew series. The order of differencing is determined by 
the number of applications before data drops to zero after 
two or three time lags. Generally, real data will not 
require more than first or second order differencing. 

Normally, a moving average refers to a continuing, 
or moving, process of computing an average fora set of 
observations, adding a new observation while excluding the 
Oldest to yield a new average. AS applied in this context, 
moving average indicates a process to isolate data not 
possible by autoregression models. Instead of basing its 
forecasts on past values of a variable, the moving average 
bases its forecast on a linear combination of past estimated 
errors. A combination of the the two methods to make a 
Single ARMA model is a powerful tool. Bikeye it tookathird 
generation computers to make ARMA models applicable on an 


Operational level. 
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Partial autocorrelations (PACF) are useful in 
identification of an appropriate ARMA model. PACF is 
defined as the last autoregressive term of an AR model. For 
an m order, or AR(m), model, only up to m terms will be 
statistically different from zero; further terms will not. 

Seasonality, defined as a pattern that repeats 
itself over a fixed interval must be accounted for in time 
series analysis. Weather factors such as average tempera- 
ture, heating degree days, and cooling degree days would be 
seasonal. Seasonality would be detected in an analysis of 
ACF by a pattern of high values emerging from one period to 
the next. Weather would be expected to show a 12 month 
seasonal pattern. A combination of seasonality and trend 
growths make autocorrelaticn patterns difficult to deter- 
mine. Data must be made stationary before seasonality can 
be easily determined. 

Similar to regression aodels, time series models use 
the sum of the squared errors (SS) and MSE in determining 
how well a curve fits the data. Since data points are ‘lost! 
during differencing and time lag calculations, the effective 


sample size decreases. 


Ee TIME SERIES MODELING TECHNIQUES 


Autoregressive (AR) models were first introduced in 
1926. Moving average (MA) models were later developed in 
1937. ARMA models did not begin their evolution until 1938. 
Since then, extensive work has been done to develop effi- 
cient procedures and extend results to include seasonal time 
series. In 1970, George Box and Gwilym Jenkins introduced a 
comprehensive method of modeling univariate time series. 
The Box-Jenkins method has become synonymous with general 
ARMA processes applied to time series analysis, forecasting, 


and control. (Ref. 11] A schematic representation of their 


61 








approach is shown in Figure 5 The Box-Jenkins method will 
be the general basis of the methodology to be presented in 
this section. However, it will be modified somewhat to 
accommodate effective implementation Within operational 
command levels of DON. Models of electricity use per square 
foot will be developed for the thirteen samples with a 
discusssion of results highlighting important concepts and 
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1. Phase I: [dentifica tion 





A specific ARMA model from 2 general class of ARMA 
processes iS initially selected for computation and evalua- 
to Ne Selection of a model should normally be based on 
Stationary data, necessitating application of differencing 


methods prior to model identification. However, some 
software packages, such as MINITAB, are capable of 
Simultaneous differencing and ARMA processing. The 


methodology used herein will assume prior or simultaneous 
differencing. 

The general notation of a model is AR{p)1I(d)MA(q), 
Or Simply ARIMA (p dq), where p, d, q represent the orders 
of autoregressive, differencing, and moving average 
processes. A similar notation for a model with seasonality 
considerations will be ARIMA (p d q)(P D Q) S=12, where the 
upper case letters have identical meaning as the lower case 
letters except for their application to seasonal orders. s 
is the length of the seasonal period of time, which will be 
defined as 12 months in this thesis. 

There are various ways to estimate the initial 
values of orders. Such methods can be time consuming, tech- 
nical, and laborious. Visual analysis, however, can also be 
effective and will be applied here. Identifying the order 
of AR process can be done by examination of the PACF plots. 
It will be recalled that the PACF is defined as the last 
significant term of an AR nodel. A Sharp drop from a signi- 
ficant value to a nonsignificant value after p time lags 
would indicate the order of the AR process. An exponential 
decay from a high initial magnitude for increasing time lags 
would indicate an MA process instead. Before making a final 
first estimate of p, it must be considered that prior 
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differencing would normally have been done. Tf nowy the 
nonstationarity effects of the data may also be included. A 
combination of trend and seasonality may be difficult to 
separate from the obsrved PACF patterns. Seasonality can be 
seen in the 12 month recurring pattern in PACF. In the 
observation of monthly energy use of the total study group, 
a trend of use can be s2en after FY 79. Differencing, 
usually a first order process, would be necessary. The PACF 
plot shows 4 Significant values before dropping to an insig- 
nificant value. A value of p=4 is suggested. A basic 
premise of the Box-Jenkins approach is that of parsimony, or 
selection of the least number of parameters. Noting the 
proximity of the fourth value to the critical 2//n point, 
ox .214, and the possible influences of seasonality and 
trend would suggest a lower number might be used, say p=2. 

An examination of the ACF plot should specify an AR 
model or indicate the order of the MA process. The behavior 
of ACF is just the opposite of PACF. An exponential decay 
for increasing time lags indicate an AR process; a drop to 
zero after gq time lags indicates an MA process of the order 
of significant points before a drop to zero. Figure 6 
demonstrates differences in behavior of particular AR and MA 
models. The ACP plot for the total study group shows 2 
Significant values before the high initial magnitude drops 
to zero. A value of g=2 1s indicated. 

Using approximately the same reasoning process for 
the seasonality effects, i.e., values for time lag of about 


02, would indicate P=1. It is difficult to separate 
seasonal trend from normal trend. An initial estimate of 
D=0, assuming the observed growth is all normal. The 


seasonal pattern in ACP shows an exponential decay for 12 
month lag, and therefore a2 seasonal AR process 1s indicated, 
We ayy Q=0. 
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Figure 3.5 Autocorrelation and Partial Autocorrelation 
Functions 


The final tentativ2 model then would be ARIMA (2 1 
AA C0 Sm12. It must be stressed that this is a 
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tentative model. It may or may not be accurate for a 
variety of reasons, such as the existence of a noise level 
lowering or raising the theoretical values of ACF or PACF, 
or the unknown Variation due to randomness. The 
identification process is not mechanical; human judgment 
plays a strong role. The final model for the study group, 
based on minimum MSE with no significant values of residual 
ACP was found to be ARIMA (2 1 2)(10 #1) S=12. The 
trial-and- error approach required several iterations before 
the ‘best* model was determined. 

As a means of providing 3a guideline for nodel 
identification, the following summary of the procedure used 
above is provided: 

1. Review the data. Visual observations of seasonality 
and trends will assist coming 2stimations. 

2. Obtain a stationary series. If possible, this should 
be done next. However, 1f software capabilities 
allow, this can be included within step 3. 

3. Examine the remaining correlations, i.e., those that 
do not drop off to zaro, to determine the order of AR, 
MA, or ARMA processes. Figure 6 show the expected 
behavior of ACF and PACF for AR({1), AR(2), MA(1), and 
MA(2) models that may aid in determination of order. 

WY. Identify seasonality effects. A repeat of steps 1-4 
procedures, looking at the ACP and PACP at the 
Seasonal time intervals. Intervals may not always be 
exactly 12, allowing for weather variations to 


compress or expand the actual interval. 
2. Phase Il: Estimation and Testing 


Estimation of parameters, according to the 
Box-Jenkins approach, is the next step of the process. The 
coefficients of the underlying mathematical equations would 
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be specified to give a minimum value of MSE. The parameters 
would then be entered into the computer for computing diag- 
nostic information. Not all software packages require 
parameter specification as input, however. MINITAB, for 
example, requires only an ARIMA specification, such as ARIMA 
(2 1 2) (1 0 1) S=12, for a seasonal model, or simply ARIMA 
(2 1 2) if the model is not seasonal. Other software 
packages, such as IDA, do require parameter specification. 
Such specification requires a mathematical understanding of 
the underlying equations of the model which is beyond the 
scope of this paper, and would be difficult to apply at the 
operational command level. Thus, it will be assumed here 
that parameter selection is not required, or that textbooks 
for the basic level of mathematical theory necessary to 
estimate the parameters, are available. This will enable a 
more general presentation of methodology. The simplicity of 
the methodology iS important to any expected application 
within DOD or DON. The MINITAB software package simplifies 
the entire approach to model building so that a thorough 
treatment of the identification process is not even entirely 


necessary. Arbitrary mod2ls can be selected, entered, and 
computed in a matter of seconds. The penalty for being too 
arbitrary is that the user loses direction. An improper 


initial model selection resulted in misdirection several 
times during the development of the models for these 


samples. 

Diagnostic checking of the estimated model is based 
on two checkpoints. The first check is to see whether the 
ACF of residuals exhibit randomness, i.e., 95% of the ACF 


points should lie within 2 standard deviations of the mean. 
The second check is that MSE is minimal. Picerugi ai ty fn 
applying these criteria is similar to that discussed for 
regression models. The existence of one or two points 
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outside the confidence limits may be unacceptable in view of 
the effort required to modify the model. The magnitude of 
MSE may be small enough that a change in MSE by a_ nore 
complex model would be of little consequence, or the rela- 
tive ease of improving the ARIMA model for even a slight 
improvement might justify additional complexity. 

In fitting the models for the samples of this study, 
three medels were chosen for illustrative inclusion in Part 
D of Appendices A-M. The first model attempted may or may 
not have been the model indicated during the identification 
phase. A second model is shown for comparative purposes as 
attempts to improve the ACF pattern and MSE magnitude were 
made. The final model shown was used to forecast. 

NRMC Jacksonville can be used as an illustration. 
An initial model was developed on observations from ACF/PACF 
plots that, (1) PACF dropped to 0 after 1 time lag, (2) ACF 
fell to zero after 3 time lags, (3) visual survey of MBTU/SF 
showed nonstationarity, (4) seasonal PACF did not drop to 9 
until after two seasonal time lags, (5) ACF showed some 
trend, and (6) ACF f211 to0O after 1 time lag. Thus an 
initial rgaodel of ARIMA (1 1 3) (1 1 2) S=12 was attempted. 
Only one ACF point was significant; an MSE of 11.518 was 
determined. Simplifying the model to an ARIMA (1 1 2) (1 1 
1) S$=12 only decreased the significance of the first point 
and increased an insignificant point to significance. MSE 
increased to 12.063. At this point, either model would be 
acceptable, although the second model would be preferable. 
The second model suggests that a seasonal adjustment might 
improve the model, since the significant points were at 
t-24, and t-37. The model ARIMA (1 1 3) (10 2) S=12 did 
improve the ACF pattern and had an “SE of 11.121. It was 
accordingly determined as the 'best' model for forecasting 
at that activity. 
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The summary statistics for the NRMC Jacksonville 
ARIMA (1 1 3) (1 0 2) =S=12 model display a 12 month forecast 
@L electricity use (FY 82). The forecast, based on the 
historical patterns and trends of use at NRMC Jacksonville 
is its expected electricity use in FY 82. The fitted nodel 
and forecasted values are shown in Part F of Appendices A-M. 
The resultant curve is really not nuch easier to visually 
interpret than the initial electricity use curves, except 
that forecasted PY 82 values have been added. An imposed 
plot of actual PY 82 use and/or baseline use might afford a 
useful and easy means for comparing actual FY 82 use. 

Since the basic intent of energy measurement is to 
compare change from the baseline, it would be useful to 
interpret the model results in this same light. With the 
fitted model values still in the computer memory bank, it is 
fairly simple to regress the fitted values against time to 
develop a curve of trend. This provides a very clear 
interpretation of the time model affording calculations of 
percentages of change anywhere along the curve. 

Part E of Appendices A-M provide a statistical 
Summary of such a regression. The value of these statistics 
is minimal relative to the normal information given by 
regression. Normal interpretation of R® and MSE are rela- 
tively meaningless in this application. But the curve does 
represent the best fit of a2 straight line through the fitted 
data points giving the trend of forecasted use. The trend 
line is shown in Part F of these Appendices. 

A composite curve 9£ actual use, fitted model, and 
tEend of wee is shown in Part G. The fitted curve of 
expected values provides a benchmark for comparison of 
actual use. The May 1978 spike in energy use previously 
discussed could now be nore effectively evaluated. The 
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model curve indicates expected use. In fact, a fairly large 
increase from April would have been expected. The magnitude 
of the difference between expected and actual use be a 
potential target of investigation. 

The FY 82 forecast has been drawn with 95% confi- 
dence limits, calling attention to the fact that the model 
curve is an expected value estimate of a ‘true’ function. 
There will be allowances for error. A confidence interval 
around the entire fitted curve of the model could have also 
been plotted. However, the loss of clarity resulting from 
the extra lines would not be justifiable. The forecast 
interval will have contol system applications to be 
discussed in Chapter V. 

A numerical summary of actual use, model forecasts, 
and trend projections for each sample is provided as 
Appendix N. At the end of each activity summary is a 
comparison of particular use with the baseline. The first 
column is the actual use reported and resultant change. The 
FY 81 target was -12%. Th2 second column shows the forecast 
values for FY 81 and FY 82 with the percent change from FY 
75. The final column is Similar to column 2 but is based on 
trend comparisons. For the study group, for example, a 9.3% 
increase was forecasted for FY 81 by the time series model. 
A 10.3% increase was actually reported. However, the trend 
of use, 10.8% might at least indicate the rate of growth 
from FY 75 had slowed. There are numerous applications of 
these results, such as relative comparisons with other 
activities, comparison of activities in a particular weather 
zone, comparisons by size, patient occupancy, or facility 
age, that could be meaningful to an avaluator. An extension 
of trend identification could be to use different origins. 
For example, NRMC Orlando showed a greater trend of increase 
afke@er FY 79. Such trend would have been averaged out over 
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=ne FY 75 - FX 82 data period. Regression of model values 
against time from FY 79 would show the trend of use from FY 
79. The effects of a particular event, such as facility 
expansion or technological innovation, could then be 
isolated for meaningful evaluation. 
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Models of electricity use by category codes using 
regression techniques will be examined in this chapter. The 
development of coefficients of use for various categories of 
use, and their application in a modular baselines, will be 
demonstrated. The average monthly electricity use and 
buildings relating to that use for FY 75 for the 12 NRMWC 
sample will be the basis of this study. 

The previously discussed problems of changing missions, 
growth of electronic technology, and the corresponding 
building use changes to accommodate such changes serve to 
Genfuse, if not distort, the basis of energy reduction 
comparisons. If the purpose of measuring electricity use on 
a per square foot basis 1s only to prorate energy use for 
different size bases, it would seen unreasonable then, in 
light of the technological growth, to expect a 20% reduction 
by FY 85. If, on the other hand, th2 goal of measurement is 
to effect an increased efficiency of use, there must bea 
means of making allowances for the changes in mission and 
building uses. 

Conversion of a low energy intensity warehouse for 
installation of a greater energy intensive aviation trainer 
was discussed in Chapter [I as a distortion of meaningful 
energy usage comparisons. Capital improvements for 
insulation, lighting, etc., may have been highly effective 
in raising the energy efficiency of the warehouse. However, 
the trainer equipment installation would probably dominate 
use such that the total energy use would be increased. The 
value of comparing the energy efficiency progress has been 
lost. 
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In other cases, building use has been downgraded as more 
modern, accommodating facilities have been constructed. 
Rather than demolish an inherently energy inefficient 
building, a common tactic is to use the building for a 
different purpose, such as office space, warshouse/storage 
space, or recreational space. [The selection of these 
particular uses 1s common inasmuch as priority of limited 
construction funds is usually lower than mission essential 
project funding. As a result, there is usually a backlog of 
such projects awaiting funding. Vacated buildings easily 
become a target for innovative, resourceful managers to ease 
particular requirements. In fact, this may be a very 
practical action with large benefits to morale. The effect 
or energy use comparisons is less beneficial. The new 
facility may have much greater energy efficiency. The 
activity's overall use per square foot may improve. The 
converted facility may also have 2 lower energy use per 
Square foot but not necessarily as a aresult of any 
efficiency improvements, though. As would be noted by TEM 
advocates, energy would be wasted by the building's 
inefficiency even though end use would have decreased. 

Only if a building's functional use remained generally 
constant during the period of comparison can any valid 
conclusions be made at present. The problem is in providing 
a basis cf comparison when changes have occurred. A means 
of effecting relatively simple changes in the baseline to 
reflect the changed uses would be useful. 

A modular construction of the baseline would offer such 
a means of adjustment. In the case of the 
warehouse-to-trainer conversion, a standard use value for 
warehouses could be deducted from the baseline use, and be 
replaced by a standard use value for a trainer facility. 
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An example of the opposite effect would be the 
conversion of a World War II vintage Butler building from an 
industrial shops area to a recreation center. The 
conversion might have taken place as the result of a new 
Public Works facility which had expanded to meet an 
increased activity mission. The Butler building with its 
characteristic thin, Sheet metal outer walls and concrete 
floors would be highly energy inefficient. The replacement 
of the industrial equipment with recreational equipment 
would certainly lower the energy use. Adjustment of the 
baseline by replacing an FY 75 standard industrial use with 
a personnel support use standard would allow ae valid 
comparison of the change. At present, however, the energy 
baselines are fairly rigid and rarely adjusted. 


A. DATA CONSIDERATIONS 


The data necessary ¢t9 Support a development of such 


standard uses is not readily available at present. The 
NAVPAC P-164, Detailed Inventory of Naval Shor2 Activities, 
was examined as a possible? source of data. The P-164 lists 


all the buildings at every location throughout the Naval 
Shore Establishment by category code, year of construction, 
floor area, dimensions, cost data, and various other details 
of inventory data. Changes, such as would occur by new 
construction of facilities, alterations, demolititons, or 
redesignated uses, are required t> be done ona timely 
basis. However, this reporting function generally has low 
activity priority. Reviews are sporadic; submissions may be 
several years out of date. 

Activities were originally tasked with developing their 
Own baseline areas. Then, any changes to be made were 
requested via official channels. It is therefore difficult 
+o reconstruct the baseline anywhere beyond the activity 
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level. The areas actually shown in the P-164 may be 
assigned to a particular station, but its energy use accoun- 
table tc another commannd. Thus, identifying a building's 
existence in FY 75 by the P-164 does not necessarily mean 
that it has been included in the baseline area of the plant 
property record holder. Pamily housing, for example, is 
listed on the plant property records of the applicable 
activities but energy use is reported to the Housing 
Management Centers. 

mae tunetional use 9£ a building or area within a 
building is identified in the P-164 by a 5 digit use code. 
Such a code 1s not necessarily related to its energy use. 
For example, 610 as the first three digits of the category 
code identifies an administrative use. 610-10 further iden- 
taries office Spaces, while 610-20 identifies data 
processing centers-two very different types of energy use. 
Living spaces are broken down between categories of officer 
(724) and enlisted (721). Further distinction is made by 
paygrade groups, such as E1-E4 (724-11) and 0O3 and above 
(724-12). The differences in these category codes reflect 
only construction criteria, although energy use characteris- 
tics would be similar. In the NRMC sample there were over 
80 significant category codes that would be considered as 
contributors to energy use. It was felt that a grouping by 
category codes of energy use charactaristics vice functional 
use code would thus be mor2 feasible. The degree of refine- 
ment within the groupings would be a matter of practicality. 
Subcategories of use by construction type and/or year would 
be specific possibilities. 

Thus, P-164 data alone is insufficient to develop accu- 
rate area baselines because of its lack of timeliness and 
lack of energy reporting detail. The P-164 is the most 
Complete and widely available data source available and was 
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thus used for purposes of this study. A more refined data 
base would be needed to provide the accuracy desired in an 
getual application. 


Be. METHODOLOGY 


The general approach taken was to test whether average 
monthly electricity use (MBTU) could be forecast as a func- 
tion of the component areas for types of functional uses of 
total floor space. The derived coefficients of a regression 
model used for such a forecast being in units of MBTU/SSF, 
suggest that the coefficients will be an average, or 
expected value, of use for the selected categories. 
Interpretation of the coefficients as standard values would 
then have useful applications where the energy use of 
particular areas are being evaluated. 

The selection of the categories of functional use would 
be made relative to the information desired. It must be 
recognized that the coefficients of a regression model are 
relative to the applied context. The coefficients may 
change as variables, i.e., categories of use, are added or 
deleted, or input data is changed. Thus, the value of a 
derived coefficient will depend on the input data and what 
other variables are present. Th2 implications of this 
recognition limit use t9 a particular time frame, to a 
particular activity, or t5 a particular group of activities, 
dependent upon the context of application. 

The average monthly electricity use in FY 75 was 
selected as the dependent variable to enable a determination 
of standards of use in the baseline year. This meant use of 
only 12 data points for th2 total sample group. The conse- 
quence of the small sample size was that the corresponding 
12 degrees of freedom allowed a maximum of only ten use 
categories, allowing 2 degrees of freedom for statistical 


results. 
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The sample was accordingly sortei into ten major energy 


use categories using the following categories and criteria: 


1. 


Zé 


ae 
6. 


Mission. Hospital space, as a large portion of the 
baseline area, would be of major interest. 

Labs. Outpatient clinics, dental clinics, as well as 
designated lab areas for which specialized equipment 


would be required. Full time occupancy would not be 
expected. 
Personnel Living Spaces. Unaccompanied officer/en- 


listed personnel housing (UOPH/UEPH), regardless of 
paygrade of occupants, and temporary living facili- 
ties. 

Maintenance/Industrial. Shops areas, including 
vehicle maintenance shops, utilizing a higher energy 
use per square foot by virtue of machinery and 
installed equipment. 

Data Processing Centers. 

Administrative. Offices, family services centers, and 
other areas providing spaces for performance of manag- 
erial, clerical and counseling functions. 

Commercial Areas. Areas which involve commercial 
sales and support such as the various functions of the 
Navy Exchanges (barber/beauty shops, snack bars, 
retail stores) which would entail various operating 
hours and medium energy intensive equipment. 

Morale and Welfare-Community Services. Areas which 
would have relatively few hours of use by groups of 
personnel, such as chapels, theaters, and child care 
centers, but would not not involve unusual equipment 
SUppOort. 

Morale and Welfare-Recreation. Bowling alleys, gymna- 
siums, and hokby centers which would have more regular 
hours of occupancy and would require some specialized 


equipment support. 
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10. Storage. Areas in which mininial equipment installa- 
tion, low personnel occupancy, and minimum heating/ 
cooling standards would be maintained continually. 
Subcategorization to reflect differences in tempera- 
ture standards for storage of equipment vice medical 
Supplies would be suggested. 

The results of the ‘full! 10 variable model are shown in 
Table 3 These results indicated that the standard use fora 
hospital space was only .J003 MBTU/SF. The t-tests, in 
fact, showed no significance in any of the individual varia- 
bles. It was concluded, by use of the F test, that all 
coefficients were insignificant. The very high R? value of 
998 would be expected from the use of 10 variables and only 
12 observations. 

A stepwise regression was then attempted. It was deter- 
Mined that a three variable model using hospital space, 
storage, and maintenance/industrial categories would provide 
Significant values for the resultant coefficients. Test 
results are shown in Table 4 The R# value was still 
reasonable at .908. 


C. RESULTS 


The results of this attempt to demonstrate model use for 
this sample were inconclusive overall. The full model was 
unable +o provide any Significant co2fficients. The "best! 
model, i.e., one which provided significant coefficients, 
could provide values for only thre= coefficients. These 
coefficients would only be relevant in the presence of the 
other two variables. That is, the standard use for the 
three categories could not be concluded to be the values 
seen, with the other cat2gories assumed to be zero. One 
explanation is that the data base was too small to support a 
test for 10 categories of use; a smaller number of groupings 
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TABLe IIift 


Regression Results of the 'Full't Model 





THE REGRESSION Se ee Is od 
= 8990. + 9.0003 X1 + 0.0901 X2 | 
~0.0356 X3 + 0.216 X& = 1.32 X5 { 
+ Ory «39 + 0.0298 £10 | 
| Sie BEV. T-RATIO = 
COLUMN COZEE ECLENT OF Carr. COEF/S.0D- | 
CONSTANT 8990 9690 0.93 
X1 ROSPITAL 0.00031 0.04396 0.01 { 
X2 LABS 0.09014 0.04366 2-06 | 
| X3 LIVING SPACES -9.03563 0.04725 -0.75 
| X4 MAINT/INDUSTR 0.2158 0.6162 O.c0 
X5 BaTA PROC.CTR -1.323 2.037 =O .72 
| X6 ADMINISTRATIVE -Q0. 1441 0.2928 -Q0.49 | 
X7 COMMERCIAL 0.0449 0.3709 0.12 
i X8 MW-COMMUNITY -0. 0025 0.3504 =O | | 
| X9 MW-RECREATION OsZV 10 0.1116 1.89 
: X10 STORAGE 0.0298 0.1316 0.23 | 
| 22 See ai OP Y ABOUT REGRESSION LINE IS | 
| pit ft 42-11) = 1 DEGREES OF FREEDOM 
R~SQUARED = 99.8 PERCENT { 
ANALYSIS OF VARIANCE 
DUE TO De SS MS=SS/DF 
REGRESSION 10 23 66 3 2633 23663253 
RESIDUAL 1 585594 585594 
| TOTAL 11 237218284 


might yield better results. However, it was felt that 
further generalization of the groupings would have seriously 
detracted from the desired results. The derived 
coefficients would have been more general than practical. 
Although the results ware inconclusive, there is reason 
to believe that the proposed method nerits further consider- 
ation. The results suggest that coefficients of use based 
Oh energy use categories could be developed by regression 
sechniques. The apparent limits imposed by *+he size of the 
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TABLE IV 


Reduced Three Variable Model 


THE ~ tea EQUATION IS | 
Zee 6h Hh Oe AY ~0.161 Z2 +0.151 X3 


Soe DEN. T2RATIO = 
OF 








{ COLUMN COEFFICIENT Compr. COEBI7S.D. { 
| CONSTANT 2136 1471 1.45 
x1 HOSPITAL 0.036023 0.006079 aie Sls. 

X3 MW-RECREATION QO. 15069 0.05448 Zend | 
. Peco OF Y ABOUT REGRESSION LINE IS | 

WITH ( 12- 4) = 8 DEGREES OF FREEDOM { 

R-SQUARED = 90.8 PERCENT | 

{ ANALYSIS OF VARIANCE | 
DUE TO DF SS SS=SS/DF 
wearinuec 3 215389 442 71796474 
RESIDUA 8 21829 185 2728646 

TOTAL 11 237218666 | 

a | 

data sample might impose certain limits. Although this 


study was limited to particular NRMCsS, in an expansion of 
this method, coefficients of use for the general categories 
of storage, maintenance, commercial, etc., could be studied 
for a large sample. 

The issue of accuracy can not be overlooked. The data 
base used in this study was recognized to be susceptible to 
Various errors. Refinement of a data base for application 
+o a large sample would be challenging. However, it must 
also be considered that even 'ball park* values might be of 
value. "Standards of use', with only minimal accuracy, 
would still permit implementation of modular adjustments to 
baselines, or to any other comparison years. The basis of 
comparisons would be improved to at least some degree. Te 
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is thus felt that the modular concept of adjustment and the 
use of regression models to develop standards of use 


warrants further consideration. 


81 





Energy reduction is a very significant and necessary 
goal in today's environment. The cost of energy has risen 
from a relatively insignificant level to one of amajor 
consideration. The era of cheap energy has resulted ina 
multitude of energy inefficient buildings that will require 
varying degrees of modifications. 

The trend of rapidly rising costs will keep a strong 
focus on economizing energy use. The Arab oil embargo of 
1973 not only touched off an era of spiralling energy costs, 
but brought about a realization of the serious consequences 
of our strong dependence on oil. The vulnerability of 
mational security and the economy to supply and price 
Manipulations were clearly demonstrated by the actions of 
the OPEC cartel. 

Government policies and actions along with the normal 
market forces of supply and demand interactions would be 
expected to expand domestic o1l supplies. These initiatives 
have also forced corporate and private energy users to 
examine the large amounts of energy being used, and wasted, 
as well as to search for alternate energy sources. 
Executive Order 12003 was issued to force the federal 
government-the largest single energy user-to reduce its 


energy use. It established the federal energy goals ata 
208 weduction from FY 75 ts FY 85, and a 35% reduction by FY 
2000. The goals appear ominous in view of the failure to 


achieve yearly target levels though PY 81. A greater level 
of capital investment will be needed just to meet the 2% per 
year expected rate of reduction, and an even greater rate of 
investment «o achieve the FY 85 overall goal. With the 
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limited availability SE Capital investment funds, 
effectiveness of investment becomes a vital issue. It is an 


issue which requires a full recognition of how energy is 


being used, identification of areas of greatest reduction 
potential, and the cost effectiveness of implementation 
actions. 


This thesis questioned the ability of the present 
Defense Energy Information System (DEIS II) to provide the 
necessary valid comparisons of energy use aS a means of 
identifying targets of reduction potential and evaluating 
the effectiveness of corrective actions. Distortions caused 
by noncontrollable factors, such as weather, and the 
particular functional uses and changes in use cf buildings 
were shown to impair the ability of a decisionmaker to 
correctly evaluate reported energy uses. Electricity use at 
12 Naval Regional Medical Centers was specifically studied. 
Regression and time series methodologies were demonstrated 
as means of improving the effectiveness of evaluating energy 
uses. 

Regression models were shown to be a means of explaining 
a system output by tzaeans of particular stimuli, or 
predictor, variables. The techniques of selecting 
variables, evaluating the appropriateness of the stimuli, 
and interpreting the results were applied to weather effects 
On energy use. Weather factors of average temperature, 
heating degree days, cooling degree days, and precipitation 
(tainfall equivalent) were specifically studied. While 
weather would not be a good sole predictor of electricity 
use, the relative significance of various factors could be 
studied. Knowing the effects of weather on electricity use 
would in turn allow the decisionmaker to assesS action to 
Minimize such effects. Cooling degrae days was seen to have 


a strong relative impact On energy use. Identifying this 
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fector and the relative @agnitude of its effects would 
suggest certain actions to a decisionmaker. Load balancing, 
increased insulation, and/for use restrictions might be 
accordingly evaluated for implementation. A comparison of a 
Similar regression model after implementation of corrective 
action would provide a further means of evaluation. 

In an expanded model, weather variables would be only 
one type of variable. Hours of occupancy, ages of the 
Buildings, air conditioning capacity, etc., might be other 
variables of interest. The results of the expanded model 
would be beneficial in further evaluation of systen 
interrelationships, and relative significance among selected 
variables. 

Time series models were shown td be a less technically 
Oriented method of forecasting. This type of model uses 
historical patterns and trends in forecasting expected 
system outputs. Plots of the four previously cited weather 
factors were seen to have variations in expected levels. Of 
particular concern was the higher average temperatures and 
corresponding decrease in heating degree days experienced at 
most of the 12 activities during the winter months of the 
baseline year. Under the DEIS II system, the various 
effects of differing weather conditions are also being 
measured in the percentage of change calculations. Weather 
would be only one of several possible influences that could 
be reflected in such energy use comparisons. The time 
series model, in forecasting expected values, provides a 
more standardized basis of comparison. 


The forecasts of expected values facilitate 
identification of variations. Comparison of a current use 
with an expected value, or 2xpected value adjusted toa 


reduction target, might show a larg2 variation which should 
be investigated promptly for corrective action. The present 
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system, with its potential of distortion, would only show a 
relative change rather than a variation from expected value. 
The decisionmaker can not readily assess the impact of 
unusual conditions on the observed change, nor assess 
whether a favorable Change was in  faet favorable. 
Similarly, an unfavorable variation might not be identified 
for a period of time due to particular conditions resulting 
in a lower energy use, thereby missing an opportunity for 
reduction. 

Regression was further applied to a time series model to 
develop a linear trend of use. This method would allow an 
identification of a long term average rate of change. 
Although the trend of use between FY 75 through FY 82 
projections was developed, the method is applicable over any 
time period, or from any reference point. 

The results of the time series forecasts and trend 
projections for FY 81 are compared to actual reported uses 
in Table 5 It can be s2en that the forecasted uses and 
actual uses are relatively close. The forecasted values, 
however, have been shown to be less subject to distortion 
and amore reasonable basis of comparison. Comparisons 
within Table 5 give a more appreciable picture of progress 
in meeting an established target. For example, NRMC Corpus 
Giewsti veported sa 1.2%«reduction from FY 75 use. Its 
expected use would have shown a 0.6% decrease. Thus, the 
activity showed a favorable variation from expected use of 
0.6%. Comparison of reported results with a 1.0% trend of 
increase shows a 2.2% favorable variation from the long tern 
trend of use. A similar comparison for NRMC Orlando shows a 
15% unfavorable variation from expacted FY 81 use, and 
alaost no difference fron the trend of increase. 
Comparisons can be made at the activity level or at a higher 
echelon of command. The total group, as might be viewed by 
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its major claimant, showed a 1% unfavorable variation from 
expected use but a 0.5% favorable variation from trend. 
Since these comparisons are less influenced by unusual 
variations of operations, weather, etc., they provide a more 
standard basis of comparison. 

A final regression model was attempted to explain 
average monthly electricity use by the functional categories 
of use for building areas. The categories of functional use 
were established by identifying particular electricity use 
characteristics. The coefficients of the proposed model, in 
units of MBTU/KSF, can be interpreted as a standard of use 
for its corresponding category variable. It was shown that 
changes ina building use, or uses of areas within a 
building, would alter the normal energy use per square foot, 
thus distorting further comparison to the baseline year. By 
means of a modular concept of the baseline, the baseline 
could be adjusted by replacement of a standard use of the 
former designation with a standard use value of the new 
designation. The results of a ten variable (energy cate- 
gory) model were found to be inconclusive by virtue of the 
imeignificance of the coefficients. The small size of the 
data base was inadequate to support the development of the 
necessary ten significant coefficients. The data base was 
capable of explaining use by three significant predictors, 
however. Concern for the accuracy of the data, which was 
based on often outdated, inaccurate property record submis- 
sions, was expressed. Further consideration of the model 
and modular baseline concept was felt to be justifiable, 
however, on the basis that even a small improvement of the 
present system by use of ‘ball park estimates would be 
useful. 

The proposed methods of energy use models all have 
applications, either individually or in combination, in 
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TABLE V 
Comparison of Forecasted and Reported Electricity Use 
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conditions at that time, may not b2 a valid comparison for 
uses without those similar influences. Variation, if iden- 
tified, may actually not be variation at ad lis ae 
unidentified, the causes of unfavorable variation may go 
uncorrected for undesirable lengths of time. 

Regression models using explanatory variables such as 
connected lcad factors and weather would be used to forecast 
expected use. A comparison of expected use with target use 
would indicate a degree of desired change. Analysis of the 
coefficients of the model would suggest opportunities for 
achieving desired reductions. Comparison of actual use with 
forecasted expected use would identify variation that should 
be investigated for corrective action. 

Time series models would be of similar use in a control 
system. Targets could be developed for expected use. A 
comparison of actual use with target use would also identify 
Variation for possible investigation. A particular advan- 
tage of the time series model is its ability to forecast use 
over a period of time. An imposed plot of actual use 
provides an easy, visual means of display. In contrast, 
regression models dependent on observed inputs such as 
weather would be less convenient. Expected values would 
require individual calculation before a plot comparison with 
actual use could be made. The application of regression 
techniques *o a time series model t¢9 develop a trend of use 
provides a further means of useful comparison. 

Application of reduction targets to the forecasts of use 
would be a means of establishing control targets. In Table 
5S, for example, a control target of a 5% reduction in 
expected FY 81 electricity use might have been established. 
Variation from that level could be used in identification of 
possible investigation. Similar application of targets to 
trend of use forecasts could be made. 
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A regression model based on categories of use provides 
at least a potential for a control systen. Comparison of 
actual use per square foot with a standard of use would be 
useful in identifying the energy efficiency of a building. 
This method, however, would require a great deal of effort 
to implement. Measurement of actual use in itself isa 
current problem with the necessity for submetering. Meters 
to provide such measurement are not in general use, at least 
not yet, within the shore establishment. However, the cost 
effectiveness of such meter uses has increased rapidly and 
should be in greater use in the near future. 

The refinement of data necessary to develop accurate 
standards of use would be a further obstacle 6, 
implementation of this type of model. The use of ‘ball 
park’ estimates could be of limited application in modular 
adjustments to the baseline. It is not known whether 
engineering estimates could provide more accurate or more 
cost effective standards. 

The choice of models would depend upon the decisionmak- 
er's needs and capabilities. Regression models of technical 
factors would appear to be of greatest benefit to engi- 
neering staffs such as the NAVFAC Engineering Field 
Divisions. Such models would provide the technical informa- 
tion needed +0 evaluate the value of particular projects. 
Their technical review function for ECIP and ETAP projects 
could be facilitated by such models. Their access toa 
broad range of data would be beneficial to development of 
the models. The variety of technical expertise readily 
available would also be significant in developing and 
applying such models. 

Time series models would appear +o represent the 
greatest general value ‘9 an activity or upper echelon 


command for forecasting 2nergy use for budget purposes as 
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well as for control systems. Models could be developed for, 
or by, bases depending on the availability of computers and 
appropriate software. 

It is highly recommended that the implementation of 
energy use models be pursued within the energy management 
program of the Naval Shore establishment. Energy use models 
have been shown to be valuable tools in the measurement and 
evaluation phases of DEIS If. 
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MONTHLY WEATHER SUMMARY 
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E. PITTING A TREND LINE 
REGRESSION OF MODELED MBIU/KSF VS MONTH 


83 CASES USED 
13 CASES CONTAINED MISSING VALUES 


THE REGRESSION eit ee Ls 


Y 8.66 +0.0 
ST. DEV. T-RATIO = 
COLUMN COEFFICIENT OF COEF. CORP yeep. 
= 8.5644 0.8739 9.92 
X1 MONTH 0.02908 0.01457 2.00 
THE ST. DEV. OF Y ABOUT REGRESSION LINE IS: S = 3.179 
WITH (| 83-2) = 81 DEGREES OF PREEDOM 
R-SQUARED = 4.7 PERCENT 
ANALYSIS OF VARIANCE 
DUE TO DF Sic MS=SS/DF 
REGRESSION 1 40.30 40% 30 
RESIDUAL 81 818.384 10.11 
TOTAL 82 859.14 
ee Y PRED. Y ST.DEV. 
<ROW NONTH ARIMA LN VALUE PRED. Y 
24 34.0 16.145 9.653 0.464 2.06R 
R DENOTES AN OBS. WITH A LARGE ST. RES. 
X DENOTES AN OBS. WHOSE X VALUE GIVES IT LARGE INFLUENCE. 
DURBIN-WATSON STATISTIC = 0.42 
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E. FITTING A TREND LINE 
REGRESSION OF MODELED MBTO/KSF VS MONTH 


83 CASES USED 
13 CASES CONTAINED MISSING VALUES 


we REG jee Ei ROG at FON FOR NRMC CHARLESTON [S: 


Se OEY. T9RATI 
COLUMN COEFRFICI EXT CF COEF. COEF/SS 
2a. BS 11 a. S327 Zor 
X1 MONTH 0.91089 0.01388 0. 
mE St. DEV. OF Y §BOOT REGRESSION LINE IS: S$ = 3. 
WITH { 83- 2) = 981 DEGREES OF FREEDOM 
R-SQUARED = 0.8 PERCENT 
ANALYSIS OF VARIANCE 
DUE TO DF 5 MS=SS/DF 
REGRESSION 1 ped > 3 9-653 
RESIDUAL 81 15.2309 9.180 
TOTAL 82 749.248 
X1 x PRED. Y ST.DEY. 
ROW MONTH ARIMA LN VALUE PRED. Y 
23 23.0 28.983 222102 0.555 
71 7 \ag0 20 2) > 222624 0.400 
83 83.0 28 .880 Bln 55 0.572 
R ==> OBS. WITH A LARGE SY. RES. 
X ==> OBS. WHOSE X VALUE GIVES IT LARGE INPLUENCE. 
DURBIN-WATSON STATISTIC = 0.6) 
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REGRESSION OF MBTU/KSF VS WEATHER VARIABLES 
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Si. DEV. T-RATIO = 
COLUMN COEFFICIENT OPC ET. COEF/S.D. 
X 1 MONTH = C053 50 0.01019 
~5. 45 
THe.S?./ OEY. OF FT @BOUL REGRESSION LINE IS S$=2.722 
With ( 95- 2) = 33s DEGREES OF FREEDOM 
R-SQUARED = 24.2 PERCENT 
ANALYSIS OF VARIANCE 
DUE TO DF SS MS=SS/DF 
REGRESSION 1 229.544 220. 544 
RESIDUAL 93 689.306 Cr 
TOTAL 94 909 .850 
X1 Y PRED. Y ST.DEV. 
RO W MO. ARIMA LN VALUE PRED. Y Siletio-« 
5 20.3395 25.749 0.528 =2.00R 
34 34 20-3 30 24.138 0.318 2-00R 
44 G4 29 «130 23.983 0.284 22208 
68 68 16.698 22. 249 0.340 Zao 8 
R DENOTES AN OBS. WITH A LARGE ST. RES. 
X DENOTES AN OBS. WHOSE X VALUE GIVES IT LARGE INFLUENCE. 
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BS. PETTING A TREND LINE 
REGRESSION OF SODELED MBIU/KSF VS MONTH 


83 CASES USED 
13 CMSES CONTAINED SISSING VALUES 


Ee eee ee E age FOR "RAC AEMPHITS IS: 


+ 0.102 
ST. DE: T-RATIO = 
COLUMN COEFFICIENT OF COEF. COEFPJ/S. 0D. 
24.443 1.020 23.95 
X1 MONTH 0.10152 0.01701 5.97 
THE ST. D Vv. OF _ Y ABOUT REGRESSION LINE IS: § = 3.713 
SiTH ({ 8 2) 81 DEGREES OF FREEDOM 
R-SQUARED = 30.5 PERCENT 
ANALYSIS OF VARIANCE 
DF ss MS=SS/DF 
REGRESSION 1. 491.07 491.07 
RESIDUAL 8 1 1116.64 13.79 
TOTAL 82 1607.72 
x1 4 PRED. Y STaDEY. 
ROW MONTH ARIMA LN VALUE PRED. Y Sherk cos 
34 34.0 36.421 27.895 0.542 2. 32R 
a5 35.0 38.941 27.997 0.531 2.98R 
§ 2 62.0 22.53 30.738 0.425 -2.33R 
R ==S0BS. WITH A LARGE ST. RES. 
X ==>O0BS. WHOSE X VALUE GIVES IT LARGE INFLUENCE. 
DORBIN-WATSON STATISTIC = 0.46 
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m FITTING & TREND LINE 
REGRESSION OF MODELED NMBIJ/KSF VS MONTH 


83 CASES USED 
13 CASES CONTAINED MISSING VALUES 


2 ee BQUATION IS; 
Y = 17.6 +0.0322 X1 


S lame DEV. T-RATIO = 
COLUMN GOmCErECI EXT OF SG0EFr. COER/S. 0D. 
—S V7 255 7 0.3465 D070 
X1 MONTH 0.932152 0.005776 57) 
mame ot. DEV. OF Y ABOUT REGRESSION LINE IS: S =1.251 
me ( )6|C6USS= hhU=CCOSsCDEGREES OF FREEDOM 
R-SQUARED = 27.7 PERCENT 
ANALYSIS OF VARIANCE 
DUE TO DF 25) WS=3S5/DF 
REGRESSION 1 225 0 49.250 
RESIDUAL 31 128.740 1.589 
TOTAL 82 We 990 
X 1 Y Pa Ow. Y ST.DEV. 
MONTH ARIMA LN VALUE PRED. Y Sle Cons 
6 2 62.0 20 ys) Bers), 0.144 P63 (8) 8! 
Le) Te te 30770 Ls Bie 1S 0.180 = Zo 
™ ==> OBS. WITH A LARGE ST. RES. 
m —"> OBS. MHOSE K VALUE SIVES IT LARGE INFLUENCE, 
DURBIN-WATSON STATISTIC = 1.63 
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Ee. FITTING A TREND LINE 
REGRESSION OF MODELED MBIU/KSF VS MONTH 


95 CaSEs USED 
1 CASES CONTAINED MISSING VALUES 


THE lag) aie E ‘eh FOR NRMC ORLANDO IS: 


23 T.8 * O& 
See DEVE “T=RATI 
COLUMN COEPFICLZ ENT Of COEPZS COEF/SS 
mica 7.7 G2 Ve los 15. 
X1 MONTH 0.20883 0.02080 10. 
meeeot. DEV. OF Y ABOUT REGRESSION LINE IS: S = 5. 
mee ( YJS- 2) = Y¥3 DEGREES OF FREEDOM 
R-S@UARED = 52.0 PERCENT 
ANALYSIS OF VARIANCE 
DUE TO Dr SS MS=SS/DF 
REGRESSION 1 3115.43 3115.43 
RESIDUAL 93 yor4.3 3 a0. 9 1 
TOTAL 94 9989.76 
X1 M4 PRED. Y ST.DEV. 
ROW MONTH ARIMA LN VALUE PRED. Y 
54 54.0 Weer? 69 29.039 We oo 0 
65 65.0 Voc 3 350 0.660 
66 66.0 17.754 31.544 0.671 
R ==> OBS. WITH A LARGE STI. RES. 
X ==> OBS. WHOSE X VALUE SIVES IT LARGE INFLUENCE. 
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Be PRPPLAGeA PREND LINE 
MEGRESSION OF MODELED MBIU/KSF VS MONTH 


S> GASES USED 
CASES CONTAINED MISSING VALUES 


5 
RAE REGRESSION EQUATION [5S 
y= QO X1 


5. F 0.02 
ST. DEV. 
COLUMN COEFFICI ENT OF COEF. 
=- 15.7066 0.4348 
x1 MONTH 02926022 0.007743 
THE ST, DEV. OF Y ABOUT REGRESSION LINE IS 
WITH ( 95- 2) = 93 DEGREES OF FREEDOM 


R-SQUARED = 10.8 PERCENT 
ANALYSIS OF VARIANCE 
DU DP 


Ss LO Deo NoHo / DE 
REGRESSION 1 46.377 8.377 
RESIDUAL os 3988 35 3 4.283 
TOTAL 94 446.740 
DURBIN-WATSON STATISTIC = Q.33 
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A. CUMULATIVE PROBABILITIES OF STANDARD NORMAL DISTRIBUTION 


Entry is area | — a under the standard normal curve from — © to z({1 — 2) 





0} .5000 S040 .5080 5120 .5160 .5199 .5239  .5279 
1 | .5398 .$438 .5478 5517 5557-65596) 5636 = .5675 
2 | .5793 .5832 5871 5910-5948 = 5987 6026 = «.6064 
3 | 6179 .6217  .6255 6293 .6331 .6368 .6406 .6443 
4) 6554 .6591 .6628 .6664 .6700 .6736 .6772 .6808 
5 | 6915 .6950 .6985 .7019 .7054 .7088 .7123  .7157 
6 | .7257 .7291  .7324 .7357-—.7389'S«.7422—s«.7454— £7486 
.7| .7580 .7611 .7642 .7673  .7704 «=.7734- 7764S .7794 
3 | .7881 .7910 .7939 .7967 .7995 .8023 .8051 8078 
9 | .8159 .8186 .8212 .8238 .8264 .8289 .8315 .8340 
1.0] .8413 .8438 .8461 .8485 .8508 .8531 .8554 .8577 
1.1 | .8643 .8665 .8686 .8708 .8729 .8749 .8770 .8790 
1.2 | .8849 .8869 .8888  .8907 .8925 .8944 .8962 .8980 
1.3} .9032 .9049 .9066 .9082 .9099 .9115 .9131  .9147 
1.4 | .9192 .9207 .9222 .9236 .9251 .9265 .9279 .9292 
1.5 | .9332 .9345 .9357 .9370 .9382 .9394 .9406 .9418 
1.6 | .9452 .9463 .9474 .9484 .9495 9505 .9515 .9525 
1.7 | .9554 .9564 .9573 .9582 .9591 .9599 .9608  .9616 
1.8 | .9641 .9649 .9656 .9664 .9671 .9678  .9686 .9693 
1.9 | 9713 .9719 .9726 .9732 .9738 .9744 .9750 .9756 
2.0 | .9772 .9778 .9783 .9788 .9793 .9798 .9803 .9808 
2.1 | .9821 .9826 .9830 .9834 .9838  .9842 .9846 .9850 
2.2 | .9861 .9864 .9868 .9871 .9875 .9878  .9881  .9884 
2.3 | .9893 .9896 .9898 .9901 .9904 .9906 .9909  .9911 
2.4} .9918 .9920 .9922 .9925 .9927 .9929 .9931  .9932 
2.5 | .9938  .9940 .9941 .9943 .9945 .9946 .9948  .9949 
2.6 | .9953 .9955 .9956 .9957 .9959 .9960 .9961  .9962 
2.7 | .9965 .9966 .9967 .9968 .9969 .9970 .9971 .9972 
2.8 | .9974 .9975 .9976 .9977 .9977 .9978 .9979 .9979 
2.9 | .9981 .9982 .9982 .9983 .9984 .9984 .9985 .9985 
3.0 | .9987 .9987 .9987 .9988 .9988 .9989  .9989 .9989 
3.1 | .9990 .9991 .9991 .9991 .9992 .9993.- .9992 9992 
3.2 | 9993 .9993 .9994 .9994 .9994 .9994 9994 .9995 
3.3 | 9995 .9995 .9995 .9996 .9996 .9996 .9996 .9996 
3.4 | 9997 .9997 9997 9997 9997 9997 .9997 .9997 
Source: Neter, John and Wasserman, Willian, 
sta ti S$t1Ca Yodels Lowen , « 





Za 


29519 
S714 
.6103 
.6480 
.6844 


.7190 
yp) by! 
.7823 
.8106 
8365 


38599 
.8810 
.8997 
9162 
.9306 


.9429 
295355 
.9625 
.9699 
.9761 


9812 
9854 
.9887 
9913 
.9934 


9951 
.9963 
.9973 
.9980 
9986 


.9990 
.9993 
.9995 
.9996 
.9997 


959 
5753 
6141 
6517 
.6879 


7224 
7549 
7852 
8133 
8389 


8621 
.8830 
.9015 
oli7 
9319 


944] 
.9545 
.9633 
.9706 
.9767 


9817 
9857 
9890 
.9916 
.9936 


9952 
.9964 
.9974 
9981 
9986 


.9990 
9993 
.9995 
.9997 
.9998 





B. PERCENTILES OF THE t-DIS TRIBUTION 


Entry is (1 — a; v) where P{t(h) <1(1 — asyp=l—a 





v 55 60 65 70 75 80 85 
l 0.158 0.325 0.510 0.727 1.000 Ears 1.963 
Z 0.142 0.289 0.445 0.617 0.816 1.061 1.386 
3 0.137 0.277 0.424 0.584 0.765 0.978 1.250 
4 | 0.134 0.271 0.414 0.569 0.74] 0.941 1.190 
Sma: 132 0.267 0.408 09559 0.727 0.920 1.156 


6 | 0.131 0.265 0.404 0.553 0.718 0.906 1.134 
7102130 0.263 0.402 0.549 0.711 0.896 1.119 
8 | 0.130 0.262 0.399 0.546 0.706 0.889 1.108 
9 | 0.129 0.261 0.398 0.543 0.703 0.883 1.100 
0 |} 0.129 0.260 O37 0.542 0.700 0.879 1.093 


11 0.129 0.260 0.396 0.540 0.697 0.876 1.088 
12 0.128 0.259 0.395 0.539 0.695 0.873 1.083 
13 0.128 0.259 0.394 0.538 0.694 0.870 1.079 
14 0.128 0.258 0.393 0.557 0.692 0.868 1.076 
15 0.128 0.258 0.393 0.536 0.691 0.866 1.074 


16 0.128 0.258 0.392 0.535 0.690 0.865 1.071 
17 0.128 0.257 0.392 0.534 0.689 0.863 1.069 
18 0.127 0.257 0.392 0.534 0.688 0.862 1.067 
19 0.127 0.257 0.391 0.533 0.688 0.861 1.066 
20 0.127 O1257 0.391 0.533 0.687 0.860 1.064 
2} 0.127 0.257 0.391 0.522 0.686 0.859 1.063 
az 0.127 0.256 0.390 0.532 0.686 0.858 1.061 
2 0.127 0.256 0.390 07532 0.685 0.858 1.060 
pa 0.127 0.256 0.390 0.531 0.685 0.857 1.059 
2 0.127 0.256 0.390 0.531 0.684 0.856 1.058 
n 0.127 0.256 0.390 0.531 0.684 0.856 1.058 
a7 0.127 0.256 0.389 0.531 0.684 0.855 1.057 
28 0.127 0.256 0.389 0.530 0.683 0.855 1.056 
29 0.127 0.256 0.389 0.530 0.683 0.854 1.055 
30 0.127 0.256 0.389 0.530 0.683 0.854 1.055 
40 0.126 0.255 0.388 0.529 0.681 0.851 1.050 
60 | 0.126 0.254 0.387 0.527 0.679 0.848 1.046 
12 0.126 0.254 0:386 °° 0.526 0.677 0.845 1.041 
a0 0.126 0.253 0.385 0.524 0.674 0.842 1.036 


cman, William, Applied Linear 


Source: Neter, John and W 
Statistical Models Irwin, 


a 





C. 


PERCENTILES OF THE F-DIS TRIBUTION 
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178 


-Sso™ 


216 
864 
5,403 


21,615 22,500 


$40,380 562,500 57 


V1 

4 5 
1.82 1.89 
55.8 S72 
225 230 
900 922 
5,625 5,764 
23,056 
6,400 
1.2] 1.25 
9.24 9.29 
19.2 19.3 
39.2 39.3 
99.2 99,3 
199 199 
999.2 999.3 
1.06 1.10 
5.34 5.31 
9.12 9.01 
15.1 14.9 
28.7 232 
46.2 45.4 
137.1 134.6 
1.00 1.04 
4.11 4.05 
6.39 6.26 
9.60 9.36 
16.0 [335 
Pe 4 Pe) 
53.4 51.7 
0.965 1.00 
3252 3.45 
Salo 5.05 
7.39 7S 
11.4 11.0 
15.6 14.9 
a1. 295 
0.942 0.977 
3.18 3.11 
4.53 4.39 
6.23 5.99 
9.15 8.75 
12.0 es 
21.9 20.8 
0.926 0.960 
2.96 2.88 
4.12 3.97 
S152 5.29 
7.85 7.46 
10.1 9°52 
b72 16.2 


Zot 


6 


1.94 
58.2 
234 

537 
5,859 
23,437 
585,940 


7 


1.98 
$8.9 
PAT | 

948 
5,925 
23,715 
$92,870 


8 


2.00 
59.4 
239 

957 
5,981 
Pan ye Pe) 
598,140 


I 32 
93u 





PERCENTILES OF THE F DISTRIBUTION 


Vi 
vy, |l—«@ 10 12 15 20 24 ; 30 60 120 CO 
1 .50 2.04 2.07 2.09 Ada | eas 213 2.17 2.18 2.20 
.90 60.2 60.7 61.2 61.7 62.0 62.3 62.8 63.1 63.3 
95 242 pi 246 248 249 250 252 250 254 
975 969 977 985 993 997 1,001 1,010 1,014 1,018 
.99 6,056 6,106 6,157 6,209 6,235 6,261 6,313 6,339 6,366 


995 | 24,224 24,426 24,630 24,836 24,940 25,044 25,253 25,359 25,464 
.999 | 605,620 610,670 615,760 620,910 623,500 626,100 631,340 633,970 636,620 


2 .50 1.34 1.36 1.3 1.39 1.40 1.41 1.43 1.43 1.44 
90 9.39 9.4] 9.42 9.44 9.45 9.46 9.47 9.48 9.49 
95 19.4 19.4 19.4 19.4 19.5 19.5 19.5 19.5 19.5 
975 39.4 39.4 39.4 39.4 a9) ee Fe 39:5 525 39.5 
ao 99.4 99.4 99.4 99.4 29.5 99:5 99.5 99.5 99.5 
ee 199 199 199 199 199 199 199 199 200 
999 999.4 999.4 999.4 999.4 999.5 999.5 999.5 999.5 999.5 

3 350 1.18 1.20 1.2! 1.23 1.23 1.24 1.25 1.26 7 
.90 5.23 5.20 5520 5.18 5-13 5.17 S15 5.14 5.13 
35 8.79 8.74 8.70 3.66 8.64 8.62 8.57 6.55 8.53 
75 14.4 14.3 14.3 Ine 14.1 14.1 14.0 13.9 13.9 
99 pA ips 27.1 26:9 26.7 26.6 26.5 26.3 26.2 26:1 
995 43.7 43.4 43.] 42.3 42.6 42.5 42.1 42.0 41.8 
eke 20:2 928.3 127.4 1264 125.9 125.4 124.5 1240 123.5 

+ 30 1.11 bs 1.14 1.15 1.16 1.16 1.18 1.18 1.19 
.90 592 3.90 3.87 3.84 5285 3:82 Bre! 3.78 3.76 
J 5.96 5,91 5.86 5.80 BTL! Sap, 5.69 5.66 5.63 
975 8.84 8.75 8.66 8.56 8.51 8.46 8.36 8.31 8.26 
9 14.5 14.4 14.2 14.0 [3.9 13.8 ae, 13.6 13:5 
995 eV 20.7 20.4 a) 2 20.0 19.9 19.6 19.5 19.3 
.999 48.1 47.4 46.8 46.1 45.8 45.4 44.7 44.4 44.1 

5. 1.07 1.09 1.10 1.11 1.12 1.12 1.14 1.14 ie 
90 3.30 a7 3.24 Be) 3.19 Sle) 3.14 8.12 Sell 
2 4.74 4.68 4.62 4.56 4.53 4.50 4.43 4.40 4.37 
75 6.62 6o2 6.43 6.33 6.28 6:23 6.12 6.07 6.02 
99 10.1 9.89 9.72 a3) 9.47 9.38 9.20 al 2.02 
995 13.6 13.4 13.1 [229 12.8 12.7 12.4 12.3 p221 
.999 26.9 26.4 2959 25.4 29.1 24.9 24.3 24.1 23.8 

6 .50 1.05 1.06 1.07 1.08 1.09 1.10 1.11 le V2 
90 2.94 eae, Y) 2537 2.84 2.82 2280 2.76 2.74 P47 
BS 4.06 4.00 3.94 3:87 3.84 3.81 3.74 3.70 3.67 
5 5.46 5.37 doc! 5.17 S12 5.07 4.96 4,90 4.85 
99 7.37 Fede 7.56 7.40 7.31 oee5 7.06 6.97 6.88 
995 10,2 10.0 9.31 a9 9.47 9.36 9:12 9.00 8.38 
999 18.4 18.0 17.6 Wi .1 16.9 16.7 16.2 16.0 15.7 

i .a0 1.03 1.04 1.05 1.07 1.0¢e~ 1,08 1.09 1.10 1.10 
.90 ae 2.67 Z.03 2.59 2.58 2,50 Aa | 2.49 2.47 
35 3.64 3.57 p51 3.44 3.41 3.38 3.30 B.2/ q.25 
715 4.76 4.67 4.57 4.47 4.42 4.36 4.2 4.20 4.14 
99 6.62 6.47 6.31 6.16 6.07 5.99 Bes 5.74 5.65 
Ebb 8.38 8.18 7.97 Tis 7.65 P52 Tl TAS 7.08 
999 14.1 13.7 I. 12.9 b2.7 12.5 12:1 11.9 11.7 


298 








FReMeCENTILES OF THE F-DISTRIBUTION 


0.860 0.915 
2.31 
3.84 
5.05 
7.01 
8.31 








PPRCENTILES OF THE F DISTRIBUTION 





10 .50 1.00 1.01 1.02 1.03 1.04 1.05 1.06 1.06 1.07 
90 2.52 2.28 2.24 2220 2.18 2.16 2.11 2.08 2.06 
ae) 2:98 Z.2 1 2.34 P| 2.74 2.40 2°02 2.58 2.54 
975 Sore 3.62 deo2 3.42 o37 Jeol 3.20 3.14 3.08 
see: 4.35 : 


90 a | 2215 2.10 2.06 2.04 2.01 1.96 93 1.90 
95 Pag fe) 2.69 2,02 ze4 51 2.47 208 2.34 2.30 
975 3:57 3.28 3.18 3.07 3.02 2170 2.85 219 2.72 
99 4.30 4.16 4.01 3.86 3.78 3.70 3.54 3.45 3.36 
995 5.09 4.91 4,72 4.53 4.43 4,33 4.12 4.01 3.90 
999 Ta? 7.00 6.71 6.40 6.25 6.09 216 5,59 5.42 
15.50 0.977 0.989 1.00 1.01 1.02 1.02 1.03 1.04 1.05 
90 2:06 2.02 1:97 1,92 1.90 1.87 1.82 1.79 1.76 
95 2.54 2.48 2.40 233 no 2 2ah0 ae 2:07 
975 3.06 2.96 2.86 2.76 2.70 2.64 252 2.46 2.40 
99 3.80 3.67 Je Jy 3:29 Saal 3.05 2.96 287 
995 4.42 4.25 4.07 3.88 3.79 3.69 3.48 3.97 3:26 
999 6.08 5.81 5.54 5:25 5.10 4.95 4.64 4.48 4.31 
20 = =~.50 0.966 0.977 0.989 1.00 1.01 1.01 1.02 1.03 1.03 
90 1.94 1.89 1.34 1.79 lai 1.74 1.68 1.64 1.61 
95 205 2.26 2.20 22 2.08 2.04 1.95 1.90 1.84 
975 Rane 4 2.68 2.50 2.46 2.41 2.9) Dig pve 2.16 2.09 
99 338 3.25 3.09 2.94 2.86 2.78 2.61 2.52 2.42 
995 3.85 3.68 3.50 3.02 gz S12 2.92 2.81 2:69 
999 5.08 4.82 4.56 4:29 4.15 4.00 3.70 3.54 38 
24 «50 0.961 0.972 0.983 0.994 +.06 «- 1.01 1.02 1.02 1.03 
90 1.88 1.83 1.78 1.73 1.70 1.67 1.61 157 Ino 
95 pas 45) 2.18 2.11 2.05 1.98 1.94 1.84 1.79 as 
975 2.64 2.54 2.44 2.33 Pay | B74) 2.08 2.01 1.94 
99 cml, 3.03 2.89 2.74 2.66 2.58 2.40 2.31 page| 
995 3:59 3.42 Be 3.06 2.97 2.87 2.66 2.09 2.43 
S39 4.64 4.39 4.14 3.87 3.74 3.99 8:29 3.14 2297 
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PERCENTILES OF THE F DISTRIBUTION 


V> i—2 


30 


120 


10.8 


2 


0.709 


-s 3eu 


0.807 
228 
2.92 
3.59 
4.51 
5.24 
7.05 
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Sole 


0.901 


4.37 


0.896 


0.891 


g 9 
0.939 0.948 
1.88 1.85 
gee 22 
2.65 2S) 
Rag G8 07 
3.58 3.45 
4.58 4.39 
0.928 0.937 
77) wie 
2.10 2.04 
241 = 2.33 
D8) 2.2 
3.13 3.01 
3.86 3.69 
0.923 0.932 
1.72 1.68 
2.02 1.96 
230 Lice 
266 ~=—-22.56 
293 = 2.81 
3.55 3.38 
0.918 0.927 
1.67 1.63 
1.94 1.88 
2.19 2.11 
2.51 2.41 
274 2.62 
3.27 3.10 





PER CEMTIIGES OF THE F&F DISTRIBUTION 





Degrees of 
Freedom for Degrees of Freedom for Numerator (df; ) 
Denom:na- 
cee etele ( [e [ae juote [ede [» [ To To To Ts fo be 
25 4241338 | 299/276 )249 | 241 | 234/228] 224}220}216 4211 
777|557| 468| 478 ca 363 |346| 332! 321| 313| 3051299) 289 
25 422|337| 289] 274}2591247| 239] 232] 227) 222] 218] 215] 210 
772|553| 464| 4141 3821359| 342| 329| 317) 309| 302} 296! 286 
27 321 | 335] 296 | 2.73 | 257 | 246 | 237] 230] 225) 2.20] 216] 213 | 208 
768 | 549| 460| 4791379) 356!339} 326) 314| 306| 298) 293} 283 
28 320/334} 295/271 |256| 244] 236] 229] 224] 219] 215] 212] 206 
764) 5.45| 457 | 407) 376) 353| 336| 323| 311! 303| 295} 290| 280 
29 4181333| 293] 270} 254| 243/235] 228| 222/ 2181214] 210] 205 
760 | 5.52| 454] 404) 373} 3.50| 333] 320| 308| 300} 292| 287 | 277 
30 $17] 332{ 292] 269/253 | 242] 234} 227/221] 216] 212] 209] 204 
756} 5399| 451 | 402|370)347| 330| 317} 306| 298| 200| 284} 274 
32 415}330]{ 290] 267 | 251 | 240] 232] 225) 219; 214] 210 | 207 | 202 
750 | 534) 446 | 397 | 366 |342|325| 312) 301| 294 ai 270 
34 413/328} 298 | 265/249 | 238{ 230] 223] 217] 212] 208} 205 { 200 
744) 529| 442} 393 | 361 | 338) 32! ees 2 89 2821276 266 
26 4°11 326| 286] 263 | 248 | 236] 228! 221} 215| 210) 206 | 203 | 189 
7391525 | £38|389)358'335}318| 304} 294) 286| 278 | 272 | 262 
38 410} 325| 2851262]246 | 235| 226} 219| 214] 209] 205] 202/196 
7395 |5211 $34) 38613541332) 375| 302) 291) 282) 275 | 269 | 259 
40 408 | 323| 284] 261 | 245 | 234) 225] 218] 212] 207] 204} 200/195 
7311518) 4371383135113291 312) 299| 288| 280| 273 | 266 | 256 
52 907 | 322} 283] 259) 2444232] 224, 217/211] 206] 202 | 199/194 
727 1515| 4291 380|349|326|3101296| 286| 277| 270) 264 | 254 
44 406 | 321 | 2821 258 | 243 | 231 | 223] 216 210] 205} 201 | 198 | 1 92 
7241512) 426 | 378|346 |324| 307) 294| 284| 275) 268 | 252 | 252 
46 405|320| 281 | 257} 242 |230| 222] 214{ 2.09] 204] 200|/ 197/191 
721|510| 424) 3761344 |322| 305| 2.92} 282| 273! 266 | 260 | 2.50 
Source: Neter, John and Wasserman, #illian, pplied Linear 
Statistical Models Irwin, 1974 oa =o 
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D. DURBIN-WATSON TEST BOUNDS 





Forna= 3 Ke4, 
95% aontidenca interval 






4-O.m, © 4-121¢279 
4-O0-W, 04-156 2246 





5 Percent Significance Pants of d, and d, (two tailed test) 














= kel k=2 k=3 koa ke5 
: ee 
A C; o, ¢, d, a, oa, re a, ¢, 
23 18 95 1.23 83 1.40 71 1.61 59 1.84 48 
16 .98 1.24 86 1 40 .7§ 1.59 §4 180 $3 
17 101 125 $0 =—«*11: 40 79 1 58 68 1.77 57 
18 1.03 126 93 149 82 156 72 1.74 62 
19 106 1.28 96 1 41 86 155 76 173 é6 
20 108 1.28 99 141 89 61.55 79 172 70 
21 110 130 3101 #141 921.54 83 1.69 13 
= 22 1.72 1.31 104 1,42 95 1.54 386 «168 77 
a 23 114 132 106 142 97 154 89 167 80 
24 1.16 133 108 143 #100 ~= «1.84 91 166 83 
25 1.18 1.34 1.10 1.43. 1.02 1.58 94 165 86 
26 119 135 112 #144 #104 #14564 96 165 §8 
27 1.21 1.36 113 144 #106 154 99 164 gt 
28 1.22 1.37 115 145 #4108 #£=1454 101 1.84 93 
23 1.24 138 1.17 145 1.10 1.54 103 1 83 96 
30 125 138 118 $146 +1312 +154 +105 163 98 
1 126 1.39 120 $147 #113 #155 4107 163 10 
= 32 1.27 140 121 #=147~ «1.16 #=155 1.08 1.83 1.02 
33 128 1.41 #2©1.22 «1.48 «21.116 #2155 110 163 ~~ «12.04 
< 4 1.29 141 #126 #=%148 #=$1.17 155 1.12 163 106 
3$ 130 1.42 #125 #168 #=+%1.:119 155 1.13 163 107 
36 1.31 #143 #126 #149 «=+1.20 «0156 +3115 163 109 
3? 1.32 143 #127 «+49 ~=§ 1.21 156 1.16 162 1.10 
7 38 1.33 1446 #128 #+%150 1.23 156 1.17 1.62 1.12 
39 1.34 144 129 1.50 1.24 1.56 1.19 163 1.13 
4 t.35 145 130 1.51 1.25 1.57 1.20 163 1.15 
45 139 148 134 #153 #130 158 125 #%163~— «212.21 
50 142 #150 1.38 $1154 #134 #+159 1:30 164 126 
55 1.45 152 1.41 1.56 1.37 160 1.33 1.64 1.20 
60 147 154 148 157 140 161 137 ~..1.65 133 
65 149 #155 146 1.659 143 163 1.40 166 = 136 
2 70 1.51 1.57 148 160 1.45 163° 1.42 166 1.39 
78 153 158 150 161 #4«®©+1.47 164 1.45 167) 1.42 
30 154 159 152 163 1.49 165 41947 #=+167 144 
: re as 156 160 153 163 41.51 166 +149 #168 = 146 
90 1.57 1.61 1.55 164 153 166 1.50 1.69 1.48 
as 158 162 156 #165 #184 +9167 #2=+152 #169 «2150 
: 100 159 163 1.57 185 155 1.67 153 1.70 1.81 . 
—- —— 
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